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EDITORIAL 


The future of ocean health 


uman and environmental health are inextricably 
linked. Yet ocean ecosystem health is declining 
because of anthropogenic pollution, overexploi- 
tation, and the effects of global climate change. 
These problems affect billions of people depen- 
dent on oceans for their lives, livelihoods, and cul- 
tural practices. The importance of ocean health is 
recognized by scientists, managers, policy-makers, non- 
governmental organizations, and stakeholders including 
fishers, recreationalists, and cultural practitioners. So 
why are the oceans still degrading? Sustainable care of 
this vast resource needs a new approach if future genera- 
tions are to inherit a legacy of vital marine ecosystems. 
Although countries pledge to protect ocean health 
through international agreements, including the United 
Nations Convention on the Law of the Sea and the Lon- 
don Convention and Protocol, 
compliance and enforcement are 
inadequate, as nations and indus- 
tries find loopholes to circumvent 
the spirit of essential guiding 
principles. The use of oceans as 
a dumping ground for countless 
wastes continues as a result. This 
is inexcusable. Governance re- 
quires strategies that are effective, 
economically feasible, and cultur- 
ally acceptable for diverse users. 
Many ocean pollutants come 
from incidental land runoff, unauthorized discharges, and 
maritime accidents that release pesticides, heavy metals, 
plastics, chemicals, and oil. Others arise from planned 
discharges of treated sewage and industrial wastes, such 
as last week’s controversial release of radionuclide-con- 
taining water from the Fukushima Daiichi nuclear power 
plant. The belief that the oceans are limitless in their 
capacity to absorb pollutants, while still able to produce 
abundant food and support complex ecosystems, is false. 
Although pollutant concentrations within “accepted 
standards” may seem small when considering the vast 
volume of the oceans, many toxicants, even at low con- 
centrations, can cause sublethal responses at the cellular, 
organismal, population, and ecosystem levels that affect 
ecosystem structure, function, and services. Also, con- 
taminants act differently in combination, with cumula- 
tive effects on ocean life, often making the “soup” more 
important than the individual pollutants. Moreover, not 
all pollutants share the same fate, as some move with cur- 
rents, others bind to seafloor sediments, some adhere to 
organic particles, and others are transported and bioaccu- 
mulated by marine organisms. Even in a single organism, 


“The use of oceans 
as a dumping ground 


for countless 
wastes continues...” 


pollutants are bioconcentrated in different organs, bones, 
and tissues, creating a range of potential toxic effects. 

Effective policies and actions to protect the ocean from 
pollution require sound science. Indeed, the United Na- 
tions declared the years 2021 to 2030 as the Decade of 
Ocean Science for Sustainable Development, with the 
goal of reversing declines in ocean health. The proclama- 
tion points to pursuing “the science we need for the ocean 
we want.’ This is valid, but science alone is not enough. 

In addition to closing environmental policy, regula- 
tory, and enforcement loopholes, political will is essen- 
tial to bring about healthier oceans. This could be better 
achieved by integrating contemporary science with Indig- 
enous knowledge about ocean stewardship. This knowl- 
edge integration has been leveraged to address issues of 
carrying capacity (the maximum amount of life that an 
ocean ecosystem can sustainably 
support), but has not evolved to 
tackle mounting, large-scale prob- 
lems of transboundary pollution 
and global climate change. For 
example, the 18 sovereign nations 
of the Pacific Islands Forum have 
demonstrated forward-thinking 
cooperative leadership by develop- 
ing a “2050 Strategy for the Blue 
Pacific Continent” to protect the 
marine environment using inte- 
grated knowledge. This regional 
shared vision provides a potential blueprint for actions 
that other nations, in partnership, could pursue. 

The Small Island Developing States (also known as 
“Large Ocean States”) are part of a group of 57 nations 
and political entities that collectively have jurisdiction 
over 140 million square kilometers of ocean, or one-quar- 
ter of the world’s exclusive economic zones. This gives 
them control over natural resources in these waters and 
as such, they wield a large impact on, and an outsized 
stake in, protecting marine ecosystems. Addressing ocean 
health over multigenerational timescales is an important 
characteristic of Indigenous cultures. Native communities 
have a strong connection to place, and hence ownership 
of problems and solutions. This ethic supported passage 
of the UN “high seas” treaty, which requires assessing the 
environmental impacts of activities in areas beyond na- 
tional jurisdictions, with an emphasis on cleaner oceans. 

Integrating contemporary science and _ traditional 
knowledge could strengthen treaties and agreements and 
forge new shared visions among nations. It’s not too late 
to set the world on a better path to restore ocean health. 

—-Robert Richmond and Ken Buesseler 
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1,870,000 


Estimated excess deaths from all causes among those 30 years and 
older in China when COVID-19 surged in the 2 months after the end of its 
“zero COVID” policy in December 2022. (JAMA Network Open) 
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India’s lunar rover, Pragyan, shown descending to the surface, is named for “wisdom” in Sanskrit. 


SPACE SCIENCE 


Indian Moon lander begins search for ice 


ndia last week put a spacecraft on the Moon, making it the fourth 

nation to perform the feat, after the United States, the Soviet 

Union, and China. The $75 million, robotic Chandrayaan-3 mis- 

sion is the first to explore a region near the lunar south pole, 

which is thought to hold frozen water that could be used to sup- 

port human explorers. The area also contains large impact craters, 
which could hold clues to the history of the Solar System. Hours after 
landing, the craft released a solar-powered, suitcase-size rover that 
will study soil composition for at least 2 weeks. The mission is a vin- 
dication for the Indian Space Research Organisation, which failed to 
complete its previous Moon-landing attempt in 2019. 


UNC materials scientist slain 


WORKPLACE | A University of North 
Carolina (UNC) at Chapel Hill graduate 
student is accused of shooting dead his aca- 
demic adviser, applied physicist Zijie Yan, in 
a campus research building this week. The 
student, Tailei Qi, was apprehended off cam- 
pus 95 minutes after the killing. He has been 
charged with first-degree murder. The shoot- 
ing prompted a 3-hour lockdown on the 
flagship campus. The Caudill Labs building, 
the site of the shooting, houses chemistry 
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laboratories and some in applied physical 
sciences, where Yan worked. Chinese-born, 
he was a graduate of Huazhong University 
of Science and Technology. He then came 
to the U.S. and earned a Ph.D. in materi- 
als engineering at Rensselaer Polytechnic 
Institute. He had been at UNC since 

2019. Yan and his accused killer, who 
complained online about his working life, 
co-authored a paper as recently as July. 

Qi joined the Yan lab in January 2022 after 
earning degrees from Wuhan University and 
Louisiana State University. 


Masks, lockdowns vindicated 


PUBLIC HEALTH | Wearing masks, social 
distancing, and other measures taken 
during the COVID-19 pandemic were 
effective at slowing viral spread, accord- 
ing to a comprehensive report released 
last week by the Royal Society. It 
describes hundreds of observational stud- 
ies and a limited number of randomized 
trials of six nonpharmaceutical interven- 
tions, including contact tracing, indoor 
ventilation, travel restrictions, and public 
health messaging, used around the world. 
All were found to help prevent transmis- 
sion, with social distancing, including 
lockdowns, coming out on top. The 
usefulness of travel restrictions varied, 
with quarantining policies implemented 
early in the pandemic scoring higher 
than pretravel screening of passengers for 
high temperatures or other symptoms. 
The report recommends that to prepare 
for future pandemics, researchers should 
collect data about other effects of such 
interventions, including impacts on 
people’s income and mental health. 


EPA restricts common pesticide 


ENVIRONMENT | The U.S. Environmental 
Protection Agency (EPA) last week put new 
limits on malathion, an insecticide com- 
monly used to control mosquitoes, aphids, 
and other pests, after a federal evaluation 
determined it could harm 78 endangered 
terrestrial species. Manufacturers must 
revise product labels to include new 
instructions, such as reducing spraying 
near critical habitat of protected species 
such as the sandhill crane and Alabama 
cavefish. The Center for Biological 
Diversity, which sued EPA more than 

a decade ago to change the labeling, 
complained about loopholes. Further 
restrictions could come by the end of the 
year, after the National Marine Fisheries 
Service completes an additional evaluation 
of malathion’s risk to endangered species 
living in coastal waters. 


Thefts hit British Museum 


ARCHAEOLOGY | The disappearance of 
about 1500 artifacts from the British 
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Gardeners uiged to plant entange 


ardens and apartment balconies could make a significant 
contribution to saving endangered plants, according to a study 
of such species in Germany. Scientists examined the plant spe- 
cies that are considered vulnerable, threatened, endangered, or 
extinct in the wild and found that nearly half could be grown in 
gardens or public green spaces or on green roofs or balconies. They 
also found that two-thirds of the 988 native, at-risk, garden-friendly 


species they identified are currently available to buy from nurseries 
and seed companies. If more gardeners opted to plant those species, 
the overall “threat status” for all plants in Germany—the ratio of the 
number of at-risk species to all species assessed for risk—could fall 
by 25%, the study’s authors report this week in Scientific Reports. 
They also found that, on average, the at-risk species tolerate drought 
better and need less fertilizer than conventional garden plants. 


Clammy campion (Viscaria vulgaris Bernh., rosy flowers) and Nottingham catchfly (Silene nutans L., white) are among plant species at risk in parts of Germany. 


Museum prompted its director to resign 
last week and many to question how a 
prestigious institution could allow so 
many of its treasures to be lost or stolen. 
Among the missing items are jewelry, 
gems, and glass dated to the 15th century 
B.C.E., The Guardian reported. An internal 
investigation by the museum had accused 
one of its longtime curators of improp- 
erly categorizing antiquities and then 
selling them to private buyers, The Daily 
Telegraph reported. The museum’s direc- 
tor, Hartwig Fischer, said in a 25 August 
statement of resignation that he accepted 
responsibility for mismanagement and 

a failure to respond to warnings about 
potential thefts in 2021. The museum has 
yet to release a comprehensive inventory of 
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the missing artifacts as police investigate 
their whereabouts. 


Costly questionnaire challenged 


PUBLIC HEALTH | The Journal of Clinical 
Hypertension has retracted a 2008 article 
that provided the empirical foundation for a 
questionnaire whose co-creator billed some 
researchers several thousand dollars each to 
use. Public health specialist Donald Morisky 
of the University of California, Los Angeles, 
cited his copyright of the instrument, which 
helps predict how likely patients are to 
adhere to a drug regimen. Rather than pay, 
several scientists chose to retract publica- 
tions that used the questionnaire. The 
retraction of Morisky’s paper, announced 


IN FOCUS A micrograph of 
a fossilized blood vessel 
from a 150-million-year- 
old diplodocid dinosaur 
was a runner-up awardee 
in this year’s BMC Ecology 


and Evolution photography 
competition. The image 
was taken by molecular 
paleobiologist Jasmina 
Wiemann of the Field 
Museum of Natural History. 


in August and reported last week by 
Retraction Watch, comes nearly 4 years 
after a scientist not involved in the research 
told the journal the math in the paper was 
implausible. In its retraction notice, the 
journal said it conducted an independent 
statistical review and concluded that the 
article’s results were misleading. 


Women pass on elite journals 


PUBLISHING | Men are more likely 

than women to submit their scientific 
manuscripts to Cell, Nature, or Science, 
according to a study of 4740 research- 
ers. Women may “have a higher internal 
standard for what constitutes novel 
research,” write the authors of the study, 
posted as a preprint last week on bioRxiv, 
with potential implications for broader 
gender disparities in academic science. 
The study team surveyed authors who 
published research from 2008 to 2017, 
asking about their manuscript submission 
experiences and academic rank. Men and 
women rated the quality of their research 
similarly. But when asked why they didn’t 
submit their work to an elite journal, 
women were more likely than men to say 
it was not groundbreaking or sufficiently 
novel. To address the gap, journal editors 
could do more outreach to women-led labs 
to encourage submission of their work, the 
authors write. 
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Hot weight loss drugs tested against addiction 


Clinical trials will gauge whether GLP-I analogs curb drug and alcohol cravings 


By Mitch Leslie 


hen the diabetes treatments known 

as GLP-1 analogs reached the mar- 

ket in 2005, doctors advised pa- 

tients taking the drugs that they 

might lose a small amount of 

weight. Talk about an understate- 
ment. Obese people can drop more than 15% 
of their body weight, studies have found, and 
two of the medications are now approved by 
the U.S. Food and Drug Administration (FDA) 
for weight reduction. A surge in demand for 
the drugs as slimming treatments has led to 
shortages. “This class of drugs is exploding in 
popularity,’ says clinical psychologist Joseph 
Schacht of the University of Colorado School 
of Medicine. 

But patient reports and animal studies 
have yielded tantalizing signs that the drugs 
may spur another unexpected and welcome 
effect: fighting addiction. Most early trials 
were disappointing, but they used less po- 
tent versions of the drugs. Now, at least nine 
phase 2 clinical trials are underway or being 
planned to test whether the more power- 
ful compound semaglutide and its chemical 
cousins can help patients curb their use of 
cigarettes, alcohol, opioids, or cocaine. Hopes 
are high. Semaglutide (sold under the trade 
names Wegovy, Ozempic, and Rybelsus) “is 
truly the most exciting drug for the last few 
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decades,” says neuropharmacologist Leandro 
Vendruscolo of the U.S. National Institute on 
Drug Abuse. 

If the results of the new trials are positive, 
addiction science could have its own “Prozac 
moment,’ says clinical neuroscientist W. Kyle 
Simmons of the Oklahoma State University 
Center for Health Sciences. In the 1980s, that 
drug brought a sea change to psychiatry, be- 
coming part of popular culture and leading 
to the wider use of antidepressants. 

Scientists have long been searching for 
new addiction drugs. Although FDA has ap- 
proved several, including three for patients 
with alcohol use disorder, these medicines 
only work for a small percentage of people 
who try them. And the pharmaceutical in- 
dustry has not delivered new compounds, 
in part because companies believe patients 
won't stick with treatments, making their 
development a poor investment, says clinical 
neuroscientist Lara Ray of the University of 
California, Los Angeles. The last “new” drug 
treatment for alcohol use disorder received 
FDA approval in 2006—and it was an inject- 
able version of a drug, naltrexone, that had 
been available since the 1980s. 

So when patients taking GLP-1 analogs for 
diabetes or weight loss reported that their 
hankering for substances such as alcohol and 
nicotine declined, researchers and doctors in 
the addiction field perked up. “You usually 


don’t hear people say that a drug makes them 
less interested in drinking,” Schacht says. 
Researchers are still probing how GLP-1 
analogs might pull off that feat. The drugs 
replicate the effects of the hormone glucagon- 
like peptide-1; by prodding its receptors in 
the pancreas, they stimulate release of in- 
sulin and trigger other beneficial responses, 
which explains how they help people with 
diabetes. But several structures in the brain 
also produce GLP-1 or carry receptors for the 
hormone—including brain areas that are in- 
volved in our reward pathways, which drive 
us to pursue pleasurable activities such as 
eating tasty food or hanging out with friends. 
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Addiction involves “hijacking of the reward . 


pathways in the brain; says behavioral 
neuroscientist Patricia Grigson of the Penn- 
sylvania State University College of Medicine. 
Researchers think GLP-1 analogs spur weight 
loss in part by quelling activity of this system, 
and the same mechanism could explain why 
people who take the medications report they 
are less motivated to drink and smoke. 
Studies in rodents and primates have sup- 
ported that mechanism and confirmed that 
the drugs diminish the desire for substances 
such as alcohol, fentanyl, nicotine, and her- 
oin. Clinical psychiatrist Anders Fink-Jensen 
of the University of Copenhagen and col- 
leagues even demonstrated that the drugs 
work in an incorrigible group of drinkers: 
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the monkeys living on the islands of St. Kitts 
and Nevis in the Caribbean. These rowdy pri- 
mates are notorious for heavy consumption 
of alcoholic drinks, which they often swipe 
from tourists. 

So far, however, only two clinical trials 
have suggested the drugs can curb addiction. 
In one, a team led by Luba Yammine, a clini- 
cian and researcher at the University of Texas 
Health Science Center at Houston, found in 
2021 that 46% of patients who wore nico- 
tine patches and received weekly injections 
of exenatide, a first-generation GLP-1 analog, 
stopped smoking, versus 27% of people who 
relied only on the patches. “That’s pretty good 
in the world of smoking cessation research,” 
Yammine says. A second preliminary trial for 
binge eating was also positive. 

Four other clinical trials came up empty. 
Fink-Jensen and colleagues performed one 
of these studies on 127 patients with alcohol 
use disorder. During the 6-month trial, all 
of the participants went through behavioral 
therapy to encourage them to drink less, 
and 62 patients received weekly injections 
of exenatide. Yet both groups cut their alco- 
hol consumption and the number of days on 
which they drank heavily by about the same 
amount, the researchers revealed last year. 
“That was a surprise for us,’ Fink-Jensen 
says. A trial for cocaine consumption was 
also negative—as were additional studies on 
smoking and binge eating. 

But all of the completed trials used older 
GLP-1 analogs, Fink-Jensen notes. Semaglu- 
tide binds more tightly to the GLP-1 recep- 
tor and induces greater weight loss than 
previous drugs. So Fink-Jensen and other 
researchers have launched new trials, almost 
all of which use semaglutide, hoping they will 
reveal greater capabilities against addiction, 
help identify which patients are most likely 
to benefit, and clarify how the drugs work. 

To try to nail down whether semaglutide 
quells cravings, Schacht and colleagues will 
ask participants with alcohol use disorder 
to watch as a research technician pours a 
brimming glass of their favorite drink. Af- 
ter raising the glass and sniffing the aroma, 
the patients will then rate their motiva- 
tion to imbibe. Participants in two other 
studies headed by clinical psychologist 
Christian Hendershot of the University of 
North Carolina School of Medicine will ac- 
tually be given the opportunity to smoke or 
drink alcohol in the lab. But they will try to 
abstain for as long as possible, earning an 
increasing amount of cash the longer they 
hold out. Offering participants the chance 
to indulge their cravings is advantageous, he 
says, because it “can improve sensitivity for 
detecting medication effects.” 

The current trials are also deploying 
techniques that measure brain activity, in- 
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cluding electroencephalography and func- 
tional MRI (fMRI), to look for clues to the 
drugs’ mechanisms. So far, only one pub- 
lished study—the clinical trial of exena- 
tide by Fink-Jensen and colleagues—has 
collected those data for patients receiving 
GLP-1 agonists as addiction treatments. The 
researchers used fMRI to determine the 
activity of three brain structures in the re- 
ward system while patients viewed photo- 
graphs of alcoholic drinks. The results were 
inconclusive. Compared with participants 
who took a placebo, patients on exenatide 
showed less activity in one structure—the 
ventral striatum—but activity in two other 
regions didn’t change. Researchers hope the 
new work will provide more clarity about 
the drugs’ effects on brain circuits. 

If GLP-1 analogs prove effective in the cur- 
rent phase 2 trials, access to the medications, 
which can cost more than $12,000 per year, 
could become a concern. Because of high 
costs and other barriers, patients with sub- 
stance use disorders often can’t access cur- 
rent addiction treatments, says psychologist 
Katie Witkiewitz of the University of New 
Mexico. “I don’t want [semaglutide] to be a 
medication that is only available to people 
who are wealthy.” 

In the United States, insurance might help 
if the drugs gain FDA approval as addiction 
treatments. That would require pharmaceu- 
tical companies to present data from larger, 
phase 3 trials. In the meantime, doctors 
could still prescribe the drugs off-label. But 
because insurance plans usually won’t cover 
off-label prescriptions, patients would have 
to foot the bill themselves. 

Researchers and doctors also fret about 
potential side effects—and not just the 
gastrointestinal problems such as nausea 
and vomiting the drugs typically cause. Ray 
wotries they might work so well that they 
crush patients’ enjoyment of life, a condi- 
tion called anhedonia that has been linked 
to depression, suicide, and relapse. And if 
patients are going to be on GLP-1 analogs 
for years, new problems could emerge. The 
once-weekly version of semaglutide that 
is so popular for weight loss only reached 
the market in 2021, notes molecular neuro- 
endocrinologist Giles Yeo of the University 
of Cambridge. “We need a better idea of 
long-term safety,’ he says. 

Doctors would face a further hurdle if the 
drugs gained approval for addiction therapy: 
determining how to work them into treat- 
ment programs. They won’t be the cure-alls 
that some popular articles imply, researchers 
warn. They might be more like Prozac and 
other antidepressants, which only work for a 
fraction of patients. But even that would be a 
boon, Ray says. “You can help a lot of people 
even if only one in five responds.” 


SCIENCE DIPLOMACY 


Tensions 
endanger U.S.- 
China research 
agreement 


Scientists on both sides 
welcome decision to 
continue talks on renewing 
collaboration 


By Jeffrey Mervis 


ising tensions between the United 
States and China could derail the re- 
newal of a 44-year-old agreement on 
scientific cooperation between the : 
two countries. 

Last week, U.S. President Joe Biden 
invited China to spend the next 6 months dis- 
cussing changes to the broad agreement, first 
signed in 1979, that enables joint research. © 
The move came after Biden rejected calls 
from some Republicans to let the pact lapse 
on 27 August, its planned expiration date. 

But the 6-month extension leaves little 
time to resolve a host of thorny issues, policy 
experts say. They include how to protect intel- 
lectual property rights to any findings, share 
data among collaborators, and ensure that 
research outcomes are fully reported. The 
Biden administration also faces calls to block * 


joint work on any technologies that could : 


have both civilian and military applications. 

Chinese officials welcomed the exten- 
sion. “As two major R&D countries, China 
and the United States should maintain con- 
tact and exchanges” in science and techno- 
logy (S&T), Chinese embassy spokesperson . 
Liu Pengyu said in a statement to Science. 
The pact’s history “has fully proved that 
China-U.S. exchanges and cooperation are 
mutually beneficial,” he said. 

Prominent U.S. scientists also applauded 
Biden’s move. “This agreement has been 
of enormous benefit to the United States,” 
wrote Stanford University physicists Steven 
Kivelson and Peter Michelson in a letter to 
Biden signed by more than 1000 scholars 
before the extension was announced. “We 
can attest that cutting off ties with China 
would directly and negatively impact our 
own research, the work of our immediate 
colleagues, and the educational mission of 
our universities.” 
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The pact, which has undergone revisions 
across several 5-year renewals, essentially 
says “that it’s OK with the U.S. government 
that [U.S. researchers] collaborate on science 
and technology with China, with appropriate 
selectivity and management,’ says physicist 
John Holdren of Harvard University, who 
served as science adviser to former President 
Barack Obama and helped negotiate an ex- 
tension in 2011. 

The US. has similar bilateral research 
agreements with some 60 countries. The one 
with China, commonly known as the Science 
and Technology Agreement (STA), neither 
provides funding for joint projects nor man- 
dates research in any particular sector. But 
it enables government agencies, universities, 
companies, and other entities in each nation 
to pursue joint research. Such projects have 
included a landmark clinical study showing 
how folic acid can prevent birth defects and 
a network of clean energy research centers. 

“Tt is particularly important, at a time of 
tension in so many dimensions of our rela- 
tionship with China, that we show that there 
are things worth preserving in that relation- 
ship,’ Holdren says. 

Congressional Republicans and _ others, 
however, say collaborating with China poses 
a major threat to U.S. economic and national 
security. “The Chinese Communist Party has 
abused the openness of the American scien- 
tific community to steal American research 
and coopt it for its own malign purposes,” 
Representative Mike Gallagher (R-WI), chair 
of the Select Committee on China in the U.S. 
House of Representatives, said after the ex- 
tension was announced. Two months ago, 
Gallagher and nine Republican colleagues 
wrote Secretary of State Antony Blinken urg- 
ing him to let the agreement lapse. 

No STA project has involved sensitive or 
classified research. But even fundamental 
research discoveries can help China gain an 
advantage over the U.S., Gallagher asserted. 
“The evidence available suggests that [China] 
will continue to look for opportunities to ex- 
ploit partnerships organized under the STA 
to advance its military objectives,’ the letter 
argues. “The United States must stop fueling 
its own destruction.” 

The STA holds symbolic significance 
for China because it was the first agree- 
ment the two countries signed after nor- 
malizing relations in 1972, says Huiyao 
Wang, president of the Center for China 
and Globalization, a Beijing-based think 
tank. It was renewed during the most tu- 
multuous times, such as after the 1989 
massacre at Tiananmen Square. Even the 
administration of former President Donald 
Trump, which was no fan of China, agreed 
to several short-term extensions before sign- 
ing the current STA, which added language 
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to protect intellectual property, in 2018. 

Since the Biden administration took of- 
fice in January 2021, it has not launched any 
new government-to-government initiatives 
under the STA. Although ending the bilateral 
agreement wouldn’t prevent government-to- 
government scientific cooperation, a U.S. De- 
partment of State spokesperson says, “each 
agency would have to negotiate arrange- 
ments for each individual cooperation.” 

Gallagher’s letter to Blinken turned what 
had been quiet negotiations between the 
two sides into a front-burner political issue. 
Ending all collaboration because some future 
project might become problematic would 
amount to “throwing the baby out with the 
bathwater,’ says Holdren, who hopes the two 
sides can reach an agreement. Even so, sup- 
porters of a new deal say changes are needed. 

“Td like to see language relating to 
greater transparency and data sharing,” says 
Sudip Parikh, CEO of AAAS (which pub- 
lishes Science), citing the potential value of 
pooling health or environmental data col- 
lected by each country. “Reciprocity has not 
always been a hallmark of China’s scientific 
enterprise.” 

The pact should include “some kind of free 
flow of information requirement,” says Denis 
Simon, an independent specialist on China 
policy. He notes that ambiguous wording in 
recent Chinese data protection laws has left 
foreign scientists unsure that they will have 
access to data generated by joint projects. 

China may be skeptical of any proposed 
changes. “If the U.S. is thinking about im- 
proving and strengthening this agreement,” 
a deal is within reach, Wang says. But if the 
US. wants to limit the pact or add inequitable 
clauses, he warns, “that could be a problem.” 

The issue's politicization is already hinder- 
ing cooperation, says Mu-ming Poo, head of 
the Chinese Academy of Sciences’s Institute 
of Neuroscience. Renewing the STA “is help- 
ful only if there is basic trust and true will- 
ingness to collaborate,’ says Poo, who spent 
decades in the U.S. before returning to China. 

US. officials acknowledge that striking a 
new deal won't be easy. “We are clear-eyed 
about the challenges associated with [China’s] 
national strategies related to S&T, as well as 
its domestic legal framework,” the State De- 
partment spokesperson said. “Strengthened 
protections in the agreement will be essential 
for any longer-term extension.” 

If a deal can be reached, Holdren predicts 
the Biden administration will take advantage 
of it to launch several joint projects. Areas 
ripe for collaboration, he says, include ef- 
forts to prevent future pandemics, improve 
nuclear reactor safety, and better monitor 
earthquake activity. 


With reporting by Dennis Normile. 
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An end to 
horseshoe 
crab bleeds? 


Proposal could allow, 
synthetic proteins 

to replace harvested 
enzyme in drug testing 


By Freda Kreier 


proposal released last week by an 
obscure U.S. pharmaceutical organi- 
zation could help end a decadeslong 
practice of bleeding horseshoe crabs 
caught along the U.S. East Coast for a 
protein used to test drug safety. 

On 22 August, the U.S. Pharmacopoeia 
(USP)—a not-for-profit group that helps set 
industry testing standards—published draft 
guidelines on using synthetic alternatives 
to horseshoe crab blood. The move could 
help reduce the need to harvest blood from 


fi 


hundreds of thousands of crabs each year, a . 


practice that kills up to 30% of the captured 
animals. That has put at risk U.S. populations 
of the 445-million-year-old arthropods, as 
well as the migratory shorebirds that feed on 
horseshoe crab eggs, conservationists say. 

The draft guidelines are a “first and im- 
portant step” toward changing how com- 
panies test for bacterial contamination in 
products, says Jaap Venema, chief science 
officer at USP. 

Conservationists welcomed the proposal. 
The lack of an industry standard for replac- 
ing the horseshoe blood protein has been 
“the biggest impediment to [pharmaceuti- 
cal companies] transitioning,” says ecologist 
David Mizrahi, vice president of research 
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and monitoring at New Jersey Audubon 
and co-founder of the Horseshoe Crab Re- 
covery Coalition. 

The discovery in the 1970s that a horse- 
shoe crab’s blue-tinged blood coagulates in 
the presence of endotoxins—a dangerous 
bacterial byproduct—was a _ breakthrough 
moment for pharmaceuticals. The coagu- 
lating enzyme, limulus amoebocyte lysate 
(LAL), soon became the global standard for 
endotoxin testing, used to check everything 
from pacemakers to chemotherapy drugs and 
COVID-19 vaccines. 

But this lifesaving test comes at a high 
cost. Because horseshoe crabs cannot be 
raised in captivity, companies in North 
America harvest blood from Atlantic coast 
animals captured as they migrate inshore 
each spring to breed. In 2016, an estimated 
400,000 horseshoe crabs were captured for 
their blood. That number has since shot up 
to about 600,000 crabs a year. 

Bleeding the crabs doesn’t necessarily kill 
them—70% to 95% survive and are put back 
in the ocean, according to conservation and 
industry estimates. But some released ani- 
mals later skip breeding or die from blood 
loss and stress, Mizrahi says. Although Atlan- 
tic horseshoe crab numbers are recovering 
from a population crash in the 1990s, they 
remain below historic levels. 


SCIENCE science.org 


Atlantic horseshoe crabs are bled every year for an 
enzyme used to test drug safety. 


Not just horseshoe crabs are at risk, 
Mizrahi says. Their breeding coincides with 
the spring migration of shorebirds, such as 
the endangered red knot, that feed on the 
crabs’ eggs before continuing north. This 
refueling is so critical that some states have 
banned harvesting female horseshoe crabs. 

Conservationists have pushed for drug 
companies to switch to synthetic proteins 
such as recombinant factor C (rFC), which 
has existed since the 1990s. The European 
Pharmacopeia endorsed using rFC in 2019. 

But much of the industry has been slow to 
switch because rFC is not endorsed by USP, 
which sets standards for drug manufacturing 
in 150 countries. Companies currently have to 
prove to regulators that synthetic alternatives 
are as effective as LAL for each product—a 
process that takes time and money. 

USP’s microbiology committee has pre- 
viously rejected proposals to replace LAL. 
But this time a large body of data supports 
using synthetic alternatives, Venema says. 
The standards will be open for public com- 
ment in November and December and 
could be finalized in 2024. 


Freda Kreier is a freelance journalist in Washington, D.C. 


Al excels 
at name 
that smell 


Neural network 
predicts odors from 
chemical structures 


By Elizabeth Pennisi 


hen Jonathan Deutsch agreed to 
sniff 400 vials of unlabeled liq- 
uid for science, he didn’t know he 
would be competing with a com- 
puter. A research chef who helps 
with food product development at 
Drexel University, he simply welcomed the 
chance to hone his sense of smell. But odor 
profiles generated by Deutsch and 13 other 
volunteers served as a test for a computer 
program that had been trained to produce 
these same types of descriptions—such as 
fruity, cooling, fishy, piny—using chemical 
structure alone. 

The results, reported on p. 999, show that 
the program, a so-called graph neural net- 
work, is excellent at imitating human sniff- 
ers, at least when it comes to simple odors. 
It reliably predicted what the volunteers 
smelled, a feat sensory biologists have been 
working toward for decades. It also predicted 
the smells of 500,000 other molecules, with 
no need to make or sniff them. 

The result is a boon for the study of ol- 
faction, a field that has “floundered around 
for years looking for this information,’ says 
Stuart Firestein, a neuroscientist at Co- 
lumbia University who was not involved in 
the work. “The approach offers great po- 
tential” to speed up the search for better . 
smelling consumer products, adds Andreas 
Grasskamp, a neurobiologist who studies 
perception at the Fraunhofer Institute for 
Process Engineering and Packaging. 

The findings may also help establish ol- 
factory research as a field on par with sight 
or vision. Smell, which in humans involves 
a smaller proportion of the brain and fewer 
types of receptor cells than in other mam- 
mals, was long considered “a primitive sense 
that wasn’t worth studying by neurobio- 
logists,’ says Pablo Meyer Rojas, a physicist at 
IBM Research. It has also defied systematic 
study. Whereas what we see and hear reflects 
quantifiable properties such as wavelength 
and frequency, smell doesn’t neatly corre- 
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spond with the shape of a molecule. Similarly 
structured molecules can smell different, 
whereas dissimilar molecules can produce 
the same odor. 

Ten years ago, a crowdsourced competition 
challenged researchers to use artificial intel- 
ligence (AD to predict an odor’s smell from its 
structure. Asked to “smell” 69 chemicals, the 
winning algorithms could pick out eight of 
19 possible odors that people had _ attrib- 
uted to these samples. But it couldn’t group 
the samples according to how similar they 
smelled, says Joel Mainland, a neuroscientist 
at the Monell Chemical Senses Center who 
helped organize the challenge. 

Rick Gerkin, a neuroscientist at the AI 
company Osmo, was one of the winners. 
Because the number of well-characterized 
odor molecules for training AI “sniffers” has 
greatly expanded since the competition, he 
thought he could do better. With Alexander 
Wiltschko, first at Google Research and 
now at Osmo, Gerkin and colleagues in- 
put the structures and odor descriptions of 
5000 molecules into a more sophisticated AI. 
It learned to recognize patterns in the train- 
ing data, correlating a molecule’s smell with 
features of its constituent atoms—their iden- 
tities, sizes, and connecting bonds. 

Then came the testing, led by Mainland. 
Deutsch and his fellow volunteers signed 
onto Zoom sessions with Monell sensory 
biologist Emily Mayhew to sniff the 400 mys- 
tery vials and report which of 55 odors they 
detected in each. Mayhew, now at Michigan 
State University, notes that perceived smells 
vary greatly between people. So the team cal- 
culated average human ratings to compare 
with the network’s predictions. In more than 
half of cases, the neural network got even 
closer to this average than any individual in 
the group did. “This is a difficult accomplish- 
ment,’ Grasskamp says. 

Next the team generated 500,000 hypo- 
thetical chemical structures. The network 
quickly inferred how they should smell, pro- 
viding a database that should help in the 
search for odors for new foods, perfumes, 
cleaners, and other products. 

Although the neural network shows it’s 
possible to map a chemical’s structure to an 
odor, it sheds little light on smell’s underlying 
biology. From a basic research perspective, 
“it’s not clear whether this is an important 
development rather than an incremental 
one,” says Linda Buck, a neuroscientist at the 
Fred Hutchinson Cancer Center. She adds 
that the neural network still hasn’t proved 
it can evaluate mixtures of molecules—the 
kinds of complex odors we encounter in the 
real world. 

“The frontier is now about mixtures,” 
Gerkin agrees. That’s what he hopes to teach 
his algorithms to do next. 


934 1SEPTEMBER 2023 + VOL 381 ISSUE 6661 


ACADEMIC RESEARCH 


Seizure of Nicaraguan university 
deals blow to nation’s scientists 


Move marks President Daniel Ortega’s latest effort to crack 
down on perceived political opponents 


By Kata Karath 


esearchers in Nicaragua say the gov- 
ernment’s takeover of a prominent 
private university has dealt another 
serious blow to academic freedom and 
scientific autonomy in the country. 
On 15 August, President Daniel 
Ortega’s administration confiscated the assets 
of the Central American University (UCA) in 
Managua and closed its campuses, alleging 
the Jesuit-run institution had become a “hub 
of terrorism” and its leaders were “traitors to 
the country.” It then reopened the campus 
with new leadership and a new name: the 
Casimiro Sotelo Montenegro National Uni- 
versity, in honor of a student activist assas- 
sinated in 1967. 

The move was the latest effort by Ortega, 
who has led Nicaragua since 2007, to crack 
down on perceived political 
opponents, including aca- 
demics. In recent years he 
has closed some two dozen 
other, mostly smaller, pri- 
vate universities and the 
nation’s National Academy 
of Sciences. 

UCA officials have re- 
jected the government’s 
allegations. But faculty 
members fear the move 
eliminates one of the na- 
tion’s last relatively independent centers of 
academic research and will result in more 
researchers and students leaving Nicaragua. 

“The [university] was the last bastion of 
quality higher education,” says one member 
of its faculty, who requested anonymity be- 
cause of fears of retaliation. “I believe that 
independent scientific investigations are over 
in Nicaragua.” 

Considered one of Central America’s top 
private universities, the 63-year-old UCA has 
about 500 faculty members and 6000 stu- 
dents, mostly undergraduates. (Ortega was 
briefly a student there in the 1960s.) It houses 
a range of research efforts, including a molec- 
ular biology center, a natural sciences insti- 
tute, and an herbarium holding some 80,000 
plant samples. UCA also provided space and 
support to the science academy, which con- 


“The trend is 
becoming really 
totalitarian. | feel like 
we're living ina 
dark medieval village.” 


Faculty member, 
Central American University 


ducted research and sponsored courses for 
teachers and journalists, as well as debates on 
issues including the proposed construction of 
an interoceanic canal through Nicaragua. 

The university has faced increasing scru- 
tiny from Ortega’s government. In 2018, 
after UCA became a hub of student-led pro- 
democracy demonstrations, Nicaragua’s Na- 
tional Council of Universities did not renew 
its operating license. Since then, “we have 
been operating, you could say, illegally,’ says a 
university official who requested anonymity. 

The government also tightly regulated fac- 
ulty research, especially in the social sciences. 
“If I was going to do an investigation [off 
campus] ... I had to inform the local repre- 
sentative of the ruling party, and they had to 
accompany [me] to each household,” a UCA 
professor says. Researchers were barred from 
accessing public records such as national sta- 
tistics and faced legal action 
if they failed to report for- 
eign funding for their work. 
As a result, many research- 
ers stopped publishing and 
UCA stopped hosting inter- 
national meetings. 

The university once re- 
ceived nearly half of its 
$16 million annual bud- 
get from the government, 
but that funding ended in 
2018. To cope, the university 
downsized staff and increased fees. “I think 
they wanted to make it so difficult that we 


id 


would close down on our own, but we didn’t,” . 


the university official says. But, “In the end, 
[the government] ... had to kick us out.” 

It’s not clear how many faculty will now 
return. Many researchers have been exiled or 
left the country under mounting threats from 
the government. Some will stay “only because 
they don’t have any other options,” the official 
says, adding: They will be “forced to say that 
everything is wonderful.” 

“The trend is becoming really totalitarian,” 
a faculty member says. “I feel like we're living 
in a dark medieval village ... where there is 
no rule of law, no social contract, where no 
one can say anything.” 


Kata Karath is a journalist in Ecuador. 


science.org SCIENCE 


PHOTO: SVETLANA SULTANAEVA/ISTOCK 


BIOLOGY 


— 


Why do cats love tuna so much? 


Study identifies taste receptors that give our feline friends a 
craving for meat—and one fish in particular 


By David Grimm 


part from Garfield’s legendary love 

of lasagna, perhaps no food is more 

associated with cats than tuna. The 

dish is a staple of everything from 

The New Yorker cartoons to Meow 

Mix jingles—and more than 6% of 
all wild-caught fish goes into cat food. Yet 
tuna is an odd favorite for an animal that 
evolved in the desert. Now, researchers say 
they have found a biological explanation for 
this curious craving. 

In a study published last month in Chemi- 
cal Senses, scientists report that cat taste 
buds contain the receptors needed to detect 
umami—the savory, deep flavor of various 
meats, and one of the five basic tastes in ad- 
dition to sweet, sour, salty, and bitter. Indeed, 
umami appears to be the primary flavor cats 
seek out. That’s no surprise for an obligate 
carnivore. But these cat receptors also seem 
to be uniquely tuned to molecules found at 
high concentrations in tuna, perhaps reveal- 
ing why our feline friends prefer this deli- 
cacy over all others. 

“This is an important study that will help 
us better understand the preferences of our 
familiar pets,’ says Yasuka Toda, a molecu- 
lar biologist at Meiji University and a leader 
in studying the evolution of umami taste in 
mammals and birds. The work could help 
pet food companies develop healthier diets 
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and more palatable medications for cats, 
says Toda, who was not involved with the 
industry-funded research. 

Cats have a unique palate. They can’t taste 
sugar because they lack a key protein for 
sensing it. That’s probably because there’s 
no sugar in meat, says Scott McGrane, a fla- 
vor scientist and research manager for the 
sensory science team at the Waltham Pet- 
care Science Institute, which is owned by pet 
food-maker Mars Petcare UK. Cats also have 
fewer bitter taste receptors than humans 
do—a common trait in uber-carnivores. 

But cats must taste something, McGrane 
reasoned, and that something is likely 
the savory flavor of meat. In humans and 
many other animals, two genes—TasIr1 
and TasiIr3—encode proteins that join to- 
gether in taste buds to form a receptor that 
detects umami. Previous work had shown 
that cats express the Jasir3 gene in their 
taste buds. But do they have the other criti- 
cal puzzle piece? 

McGrane and colleagues biopsied the 
tongue of a 6-year-old male cat, eutha- 
nized for health reasons. His taste buds 
expressed both the Tasir] and TasIr3 
genes—the first time scientists showed 
that cats have all the molecular machinery 
needed to detect umami. 

When the researchers compared the pro- 
tein sequences encoded by these genes with 
those of humans, however, they found a 


striking difference: Two critical sites the ied 
low the human receptor to bind to glutanne 
and aspartic acid—the main amino acids 
that activate umami taste in people—were 
mutated in cats. “So I began thinking, maybe 
cats can’t taste umami,” McGrane says. 

To double check, he and his team engi- 
neered cells to produce the cat umami re- 
ceptor on their surface. They then exposed 
the cells to a variety of amino acids and nu- 
cleotides. The cells did respond to umami— 
but with a twist. In people, the amino acids 
bind first and the nucleotides amplify the 
response. But in cats, the opposite was true. 

In the last part of the experiment, 
McGrane and colleagues gave 25 cats a taste 
test. In a series of trials, they presented the 
felines with two bowls of water, each with 
various combinations of amino acids and 
nucleotides, or just water alone. The ani- . 
mals showed a strong preference for bowls 
that contained molecules found in umami- 
rich foods, suggesting this flavor—above all 
others—is the primary motivator for cats. 

“T think umami is as important for cats as 
sweet is for humans,” Toda says. Dogs, she 
notes, taste both sweet and umami, which 
may explain why they’re not such fussy eaters. 

But it wasn’t just umami in general the 
cats craved. They were especially keen for 
bowls containing histidine and inosine 
monophosphate—compounds found at 
particularly high levels in tuna. “It was 
one of the most preferred combinations,” 
McGrane says. “It really seems to hit that 
umami sweet spot.” 

That jibes with Toda’s past experience. As 
a veterinary student, she got cats with no 
appetite to eat by sprinkling their food with 
dried flakes of bonito—a common umami 
ingredient in Japan and a close relative of 
tuna. “It worked very well!” she says. 

Indeed, one application of the work could 
be developing foods that are more palat- 
able to cats, McGrane says. He also thinks " 
a spoonful of umami (figuratively speak- 
ing) could help feline medications go down 
easier—welcome news for anyone who’s al- . 
most lost a finger trying to pill a cat. 

Why cats have a hankering for tuna in 
the first place remains a mystery. It may 
have developed over time. They’re depicted 
eating fish in the art of Ancient Egypt, and 
by the Middle Ages, felines in some Middle 
Eastern ports were consuming large quan- 
tities of fish—including tuna—likely left by 
fishers. Cats that evolved the taste may have 
had an advantage over their comrades, says 
Fiona Marshall, a zooarchaeologist at Wash- 
ington University in St. Louis. 

“We're at a starting point—it’s not a fin- 
ished story,’ McGrane admits. “But all of 
this work is building up to our basic under- 
standing of what it means to be a cat.” & 
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ne morning in June, barely 
5 months after the first dispensary 
for recreational cannabis opened 
in New York state, neuroscientist 
Yasmin Hurd spoke via Zoom to an 
audience of educators and special- 
ists who work with or run programs 
for children. The session’s organiz- 
ers, alarmed by how many children 
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As states relax their laws on 


cannabis, neuroscientist 


Yasmin Hurd is warning about 


By Ingrid Wickelgren 


in their South Bronx community were now 
getting their hands on cannabis, had sought 
Hurd’s expertise on the drug’s effects. 

Hurd put up a slide of the human brain, 
its bumps and grooves tinged blue, green, 
yellow, and red to indicate the distribution of 
the receptors to which tetrahydrocannabinol 


the drug’s dangers 
for the developing brain 


(THC), the psychoactive ingredient in can- 
nabis, binds. She showed how they exist 
throughout the brain—in the folds of the ce- 
rebral cortex, where much of cognition lies; 
the cauliflower-shaped cerebellum, the seat 
of motor coordination; the hippocampus, 
Grand Central for memory; and the amyg- 
dala, a crucial hub for emotional regulation. 

The receptors, said Hurd, who heads an 
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addiction research lab at the Icahn School 
of Medicine at Mount Sinai, are “really criti- 
cal for so many processes in the brain.” And 
when a person uses cannabis—in any of its 
edible, dabbable, smokable forms—the drug 
overwhelms them and disrupts their ability 
to calibrate neuronal activity. 

That, in turn, can be profoundly prob- 
lematic for the developing brain, Hurd’s re- 
search suggests. She sees growing evidence 
in the field that cannabis use puts children 
and adolescents at risk for a variety of psy- 
chiatric problems, from dependence on that 
drug and others to schizophrenia. In utero 
exposure, she believes, can ignite mental 
health problems in childhood and beyond. 
In studies with rats, human fetal 
tissue, and children, her lab has 
begun to uncover changes in gene 
expression, as well as alterations in 
the brain’s chemical communica- 
tion systems and wiring, that may 
underlie some of these effects. 

Hurd’s work is especially com- 
pelling because she has been able 
to link results across species, col- 
leagues say. “It’s so hard to be able 
to go back and forth between ani- 
mal models and effects in humans,” 
says Susan Tapert, an addiction re- 
searcher at the University of Califor- 
nia, San Diego. “She’s really one of 1b 
the leaders in the field in being able 
to pull those very different kinds of 
studies together.” Tapert agrees the 
evidence for harmful effects on the 
developing brain are concerning, 
although she says the harm likely 
varies widely from one individual 
to another. The risks for adults are 
lower, she says, as the drug’s influ- 
ences on memory, mood, sleep, and 
motivation tend to wane within 5 
about a month of discontinued use. 

Research on cannabis’ devel- 
opmental effects has grown in re- 
cent years, but Hurd “definitely 
pioneered this field,” says Miriam 
Melis, a neuroscientist at the Uni- 
versity of Cagliari. Hurd also had to 
prove herself as a Black woman in 
a discipline then dominated by white men, 
Melis says. “For me she’s an inspiration of a 
woman in science.” 

Her work has become increasingly rel- 
evant, as U.S. states—23 so far, plus Wash- 
ington, D.C.—legalize cannabis for adult 
recreational use. Hurd’s findings raise “big 
red caution flags,” says Eric Nestler, an addic- 
tion researcher and director of the Friedman 
Brain Institute, which Hurd’s lab is a part of. 
Legalization is “not a free decision. It is a de- 
cision that, according Yasmin’s data—and I 
would agree with her data—will bring costs.” 
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Some adults might be able to use canna- 
bis quite safely, experts say, yet the legaliza- 
tion trend has made the drug increasingly 
accessible to pregnant women and also 
children, who may ask adults to buy it, 
take it from parents, or use fake IDs to get 
it. In the Bronx, kids as young as age 11 or 
12 know which shops will sell to minors, ac- 
cording to Davon Russell, president of the 
Women’s Housing and Economic Develop- 
ment Corporation, a Bronx community de- 
velopment organization, who invited Hurd 
to speak. At the same time, the potency of 
the products on offer has risen sharply (see 
graphic, below) and is only loosely regulated 
by states. Customers “have no clue about the 
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substances they are consuming,” Hurd says. 
She has seen the consequences firsthand: 
Besides running her lab, she directs Mount 
Sinai’s Addiction Institute, overseeing in- 
patient and outpatient treatment centers, as 
well as programs for kids. 

Although Hurd opposes the criminaliza- 
tion of cannabis use and possession, she 
believes legalization has come with under- 
appreciated downsides. She’s concerned it 
has fanned a permissive culture and a per- 
ception that the drug is generally safe. “I am 
worried about how cavalier we’re becoming 


and that there is a cannabis smoke shop now 
practically, in some places, on every other 
block,’ she says. “I feel frustrated that people 
are willing to sacrifice kids and young people 
for their quote-unquote right to get high.” 
Her science, she hopes, will foster a greater 
awareness of the potential harms. 


HURD REMEMBERS HER FIRST science experi- 
ment. At about age 6, she set up tin pans 
with rice outside her home in Jamaica, 
varying the amounts of water and shade to 
see how using sunlight to cook rice under 
differing conditions affected its quality. 
She was an inquisitive child who liked to 
question the rules. Why, she recalls asking 
her parents, did she have to drink 
milk? Did they know that humans 
are the only species that drinks 
other species’ milk? 

In the early 1970s, when Hurd 
was about 10, her parents divorced 
and she moved to New York City 
with her mother and siblings. She 
loved it from day one. “I’m defi- 
nitely not a stereotypical Jamaican, 
in that some Jamaicans are just 
like, ‘No problem man, tomorrow, 
tomorrow. I came to New York, 
and everyone was moving, moving, 
moving,” she says, and thought: 
“This is my place!” 

At South Shore High School in 
Brooklyn, she was the only Black 
person in her honors classes. “You 
are a Black girl and they are always 
challenging you that you really know 
anything,” she says. Yet she excelled 
in her science classes and studied 
German so she could read classic ex- 
periments in their original language. 
Hurd says her family valued educa- 
tion, and their high expectations 
helped propel her to college. 

At Binghamton University, she 
convinced administrators to cre- 
ate a new degree for her: a B.A. in 
biochemistry and behavior. It was 
meant to blend her two main inter- 
ests, chemistry and behavior—but 
she now jokes that it was essentially 
a neuroscience major before that became a 
thing. In graduate school at the Karolinska 
Institute in Stockholm, her colleagues gave 
her another lesson in expectations—this 
one not based on her skin color. “They said, 
‘You're American; therefore, you must be the 
best;” Hurd says. Their high expectations 
motivated her to measure up. 

In grad school, Hurd helped develop tech- 
niques for measuring neurotransmitters in 
the rat brain. In some of her experiments, 
she used amphetamine or cocaine to artifi- 
cially raise dopamine levels. She was fasci- 


2020 


1 SEPTEMBER 2023 + VOL 381 ISSUE 6661 937 


Neuroscientist Yasmin Hurd (bottom right) examines a slide of donated human brain tissue with students in her lab at the Icahn School of Medicine at Mount Sinai. 


nated to see a mild-mannered rat suddenly 
become hyperactive, and at higher doses, 
aggressive and ready to pounce. “If you’ve 
never seen a paranoid rat,” she says, “it was 
just ferocious.” 

As a postdoc at the National Institute 
of Mental Health in the early 1990s, she 
learned some of the then-new molecular 
biology tools to study how cocaine affected 
cells and receptors in rodent brains. But 
she wasn’t satisfied. “I needed it to have a 
human relevance,” Hurd recalls. 

After finding a National Institutes of 
Health pathologist who had started a brain 
bank that included cocaine users, she set 
out to measure messenger 
RNA (mRNA) | transcripts 
lingering in the tissue af- 
ter death, hoping to gauge 
gene expression in the users’ 
brains. Other scientists told 
her she was on a fool’s errand 
because, they said, mRNA be- 
comes unstable after death. 
But she proved them wrong, 
and was able to identify mo- 
lecular changes in humans 
that matched findings in rats 
exposed to cocaine, as well 
as some key species differ- 
ences in reward regions of 
the brain. 

When Hurd returned to 
Karolinska as an_ assistant 
professor in the early 1990s, 
she set up her own brain bank, 
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consisting primarily of users of amphetamine 
and heroin. “She was really a pioneer in us- 
ing human brain tissue to understand the 
neurobiology of drug addiction,’ Nestler 
says. Her brain collection, which she ex- 
panded at Mount Sinai, “provided assur- 
ance that mechanisms scientists study in the 
lab, say in rodent models, really focused on 
things that had human relevance.” 

It was rodents alone, however, that en- 
abled Hurd to make her first mark on the 
broader debate about cannabis. Epidemio- 
logical studies had suggested people who 
use cannabis early in life are more likely 
to later become addicted to drugs such as 


The cannabinoid type 1 receptor, a target for the psychoactive ingredient in cannabis, 
is found throughout the adult human brain (warm colors represent higher concentrations). 


cocaine and heroin, inspiring the so-called 
gateway hypothesis. Many scientists and 
laypeople believed the effect was strictly 
environmental—that is, using cannabis is 
likely to introduce people to a drug-using 
crowd and to a dealer who also hawks 
harder drugs. But Hurd thought there 
might also be a biological connection. 
What if, she thought, cannabis changes the 
developing brain in a way that made some 
people vulnerable to addictive substances 
more generally? 

To investigate, Hurd’s team exposed ado- 
lescent rats to THC and found the rodents 
later self-administered heroin at increasing 
rates, reaching dosages far 
higher than controls. Early 
THC exposure also altered 
gene expression in a reward 
center of the brain, they found, 
suggesting the drug can al- 
ter the brain’s endogenous 
opioid system, which is in- 
volved in the perception of 
reward, stress, and pain. Re- 
viewers were skeptical, Hurd 
says, but the manuscript fi- 
nally came out in 2007 in 
Neuropsychopharmacology, 
the year after she became a 
professor at Mount Sinai. 

It was some of the first 
strong biological support 
for the gateway hypothesis, 
Nestler says, helping con- 
vince researchers, educators, 
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and policymakers that biology was part of 
the picture. 


SPEAKING TO THE AUDIENCE of educators in 
June, Hurd painted cannabis dependence as 
a biological condition. “Many people think, 
‘Oh you can’t become addicted to cannabis,” 
she says, “but when you look at the numbers 
out there, cannabis use disorder is actually 
quite common.” Estimates vary widely, but 
up to 30% of users become unable to stop 
using the drug despite negative effects on 
their health and well-being, according to the 
National Institute on Drug Abuse. 

Adolescents are especially vulnerable, 
Hurd told the Zoom audience, because the 
endocannabinoid system—a network of 
natural signaling molecules structurally 
similar to THC, along with their receptors— 
plays a central role in brain development. It 
fine-tunes the maturation of the prefrontal 
cortex, a brain area involved in self- 
control and decision-making. In 2019, 
Hurd and her colleagues reported that re- 
peated THC exposure during adolescence 
in rats changed the shape and function of 
neurons in the animals’ prefrontal cortex. 
In her presentation, Hurd showed a neon 
green-and-yellow neuron with sparse, 
stunted branches next to its much bushier 
normal counterpart. The simpler struc- 
ture, Hurd explained, means fewer con- 
tacts with other neurons. 

Her study also revealed a pattern of 
gene expression in the rats’ THC-exposed 
neurons that overlapped significantly with 
gene expression profiles seen in people with 
schizophrenia. It was a hint that early can- 
nabis use might sometimes pave the way to 
this psychiatric disorder, as epidemiological 
studies have suggested. Human studies 
also support concerns that early use might 
have lasting effects. A longitudinal study 
of 799 European adolescents published in 
2021 linked cannabis use with a thinning 
of the prefrontal cortex in regions where 
cannabinoid receptors are expressed, and 
with higher levels of impulsiveness. 

Yet much is not known about adolescent 
risk, Tapert says. She adds that a longitu- 
dinal look at nearly 12,000 U.S. children 
called the Adolescent Brain Cognitive De- 
velopment Study, launched in 2016, should 
provide critical data on how cannabis use 
interacts with characteristics such as a 
person’s genetics, history of trauma, stress, 
and family mental health history. 

However such factors shift the balance, 
the increasing potency of cannabis likely 
adds to the risks. In a 2022 paper published 
in Molecular Psychiatry, Hurd along with 
Jacqueline-Marie Ferland, a neuroscientist 
in her lab, reported that exposure to 
high-dose THC (equivalent to a strong 
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recreational human dose of about 20 milli- 
grams, or four typical gummies), but not 
low-dose (equivalent to one 5-milligram 
edible), once every 3 days made rats un- 
usually sensitive to environmental stress- 
ors such as isolation. After such stress, the 
rats tended to avoid other animals—a sign 
of social anxiety—and to consume more 
sugar than controls, indicating increased 
sensitivity to reward. Earlier this year, 
Ferland, Hurd, and their colleagues re- 
ported in JAMA Psychiatry that high-dose 
THC also caused rats to make risky deci- 
sions in a “rat gambling task,’ in which 
a rat must choose between risky and safe 
strategies for winning sugar pellets, be- 
havior similar to that seen in human study 
participants with cannabis use disorder 
gambling for money. 

High- and low-dose THC also have dis- 
tinct effects on the rat brain, they showed. 
High doses altered the shape of neuronal 
support cells called astrocytes and caused 
changes in gene expression that suggest 
disruptions to signaling by the inhibitory 
neurotransmitter GABA. Low-dose con- 
sumption, on the other hand, primarily 
distorted the shape and gene-expression 
patterns of neurons and spurred changes 
in the opioid system. It’s not yet known 
whether such changes also happen in the 
human brain, but Hurd thinks the rat 
studies hint at worrisome biological con- 
nections between cannabis use and neuro- 
transmitter systems involved in a wide 
range of behaviors. 


IN A 2019 STUDY, 7% of girls and women ages 
12 to 44 reported using cannabis while preg- 
nant. That proportion doubled between 2002 
and 2017, the researchers found, and it may 
be even higher today. The fetus, inevitably, is 
exposed: THC readily passes through the pla- 
centa to the fetal brain. 

To look for possible effects, Hurd began 
a collaboration in the early 2000s with 
Diana Dow-Edwards, a neuropharmaco- 
logist at the SUNY Downstate Health 
Sciences University. At the time, Dow- 
Edwards had access to fetal tissue from 
women with a history of drug use who 
had chosen to have an abortion. In women 
who had smoked cannabis, Hurd and 
Dow-Edwards found alterations to the 
fetal brain’s dopamine system, including 
reduced expression of dopamine receptors 
in the amygdala and nucleus accumbens, 
a reward center. The finding hints that in 
utero cannabis exposure could interfere 
with emotional regulation and boost vul- 
nerability to addiction. 

It’s just part of the havoc the two re- 
searchers uncovered in the fetal brain. THC 
reshuffled gene expression in its natural 


How a component 
of cannabis 
could fight addiction 


asmin Hurd has spent much 

of her career documenting the 

harms caused by the psycho- 

active compound in cannabis, 

tetrahydrocannabinol (THC). 
Ironically, she believes another can- 
nabis ingredient, cannabidiol (CBD), 
could help break cannabis depen- 
dence. Her initial focus, though, is on 
testing it to help heroin users. 

Ina seminal study published in 
2009, she showed CBD could reduce 
drug-seeking behavior in rats previ- 
ously exposed to heroin, perhaps by 
reducing craving triggered by cues 
they had associated with the drug. 
“CBD could actually do the opposite 
of THC,” says Hurd, who heads an 
addiction research lab at the Icahn 
School of Medicine at Mount Sinai. 
In 2019, she and her clinical team re- 


ported that compared with a placebo, 


a CBD capsule taken once a day for 
3 days reduced drug cravings and 
anxiety in 45 human heroin users. 
Hurd’s ballooning clinical crew is 
gearing up for larger trials. First up is 
a trial of CBD in 200 people with opi- 
oid use disorder, followed by larger 
scale trials with long-term follow-up 
that will not only look at substance 
abuse and anxiety, but also general 
life outcomes such as employment, 
parenting, and avoiding trouble with 
the law. Mount Sinai neuroscientist 
and former social worker Keren 
Bachi, who helps direct this effort, is 
impressed by Hurd's ability to think 
like a clinician. “She has the vision of 
designing the studies in a way of see- 
ing whether this intervention would 
make a meaningful difference in the 
lives of people,” Bachi says. —I.W. 


1 SEPTEMBER 2023 * VOL 381 ISSUE 6661 


939 


NEWS FEATURES 


= 


The opening of the first recreational cannabis dispensary in New York City in December 2022 drew long lines. 


Inside, a worker organized jars of cannabis flower. 


opioid system. It also tampered with the 
cytoskeleton, or internal scaffold, of develop- 
ing neurons, reshaping their long extensions 
and thereby altering the neuronal wiring in 
parts of the fetal cerebral cortex. 

When Hurd’s group tried to re-create 
the effects of maternal cannabis use in ro- 
dents, they saw behavioral consequences. 
Male rats exposed to THC in the womb 
more readily self-administered heroin as 
adults than controls. And in work pub- 
lished just last year in Biological Psychia- 
try, rats exposed to THC in utero showed 
low motivation, depressionlike traits, and 
increased sensitivity to stress as adults. 

Soon after she arrived at Mount Si- 
nai, Hurd got the opportunity to find out 
whether something similar happens in chil- 
dren exposed to cannabis in utero. A young 
assistant professor there, Yoko Nomura, 
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had started an ambitious longitudinal study 
of pregnant women to examine how various 
aspects of the prenatal environment such as 
stress, toxins, and maternal obesity affect 
children. Nomura kept running into Hurd 
at meetings and was immediately drawn 
to her affable personality. “She is very ap- 
proachable,” says Nomura, now a professor of 
psychology at Queens College at the City Uni- 
versity of New York. The two joined forces. 
For 15 years, Nomura, Hurd, and their col- 
leagues followed the women and their 724 
children, evaluating them annually. They 
also sequenced mRNA in their placentas to 
monitor the activity of thousands of genes. In 
a paper published in 2021 in the Proceedings 
of the National Academy of Sciences, Hurd, 
Nomura, and their colleagues reported that 
mothers’ cannabis use was associated with 
hyperactivity and heightened anxiety and 


aggression among their children at ages 3 
to 6, along with increased cortisol, a stress 
hormone, in hair samples. It also reduced 
expression of placental genes involved in im- 
mune function, which the endocannabinoid 
system helps regulate. These changes corre- 
lated with the children’s future anxiety and 
hyperactivity levels. 

Superstorm Sandy, which hit the New York 
metropolitan area in 2012, enabled Hurd and 
Nomura to show that stress exacerbates these 
prenatal effects. Women who were pregnant 
during the storm used cannabis at high rates, 
likely as a coping mechanism, and in the 
years since, their children have shown signs 
of trouble. At ages 2 to 5, Hurd and Nomura 
reported in May, these children were 31 times 
more likely to meet the criteria for disrup- 
tive behavior disorders and seven times 
more likely to have an anxiety disorder than 
kids exposed to neither cannabis nor Sandy 
in utero. “It’s a drastic synergistic increase,” 
Nomura says. 

That work suggests the effects of can- 
nabis on children may also be amplified 
in communities and families with “much 
greater psychosocial challenges,” Hurd 
says. These may include neighborhoods of 
color beset by poverty, violence, and a dis- 
proportionate number of arrests, she says. 


ALARMED AS SHE IS about these many risks, 
Hurd does not support rolling back legal- 
ization. Criminalizing cannabis possession, 
she notes, has exacted disproportionate 
costs to communities of color. It also hurts 
people grappling with drug addiction. “It’s 
absurd in a civilized society that we think 
that locking people up for substance use 
will cure the problem. It actually wors- 
ens the problem,” she says. Instead, she 
favors regulations that limit potency and 
using tax revenues from the sale of canna- 
bis to educate people about the risks, and 
for treatment and research to help those 
harmed by its use. 

Hurd spends much of her time fighting for 
space for Mount Sinai’s addiction treatment 
centers. It’s a constant, depressing battle 
against the stigma of substance use disor- 
ders, she says, despite an overdose epidemic 
that has gripped the nation. A big part of the 
problem is money. “Addictions are not a clin- 
ically profitable specialty,’ she says. 

But she’s determined to keep fight- 
ing. “There are a lot of people who have 
a substance use disorder who would give 
everything to get back a normal life. Ev- 
erything,” Hurd says. “That’s why I’m so 
committed to this career. It’s to help give 
people their lives back.” ™ 


Ingrid Wickelgren is a freelance science journalist 
based in New Jersey. 
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and policymakers that biology was part of 
the picture. 


SPEAKING TO THE AUDIENCE of educators in 
June, Hurd painted cannabis dependence as 
a biological condition. “Many people think, 
‘Oh you can’t become addicted to cannabis,” 
she says, “but when you look at the numbers 
out there, cannabis use disorder is actually 
quite common.” Estimates vary widely, but 
up to 30% of users become unable to stop 
using the drug despite negative effects on 
their health and well-being, according to the 
National Institute on Drug Abuse. 

Adolescents are especially vulnerable, 
Hurd told the Zoom audience, because the 
endocannabinoid system—a network of 
natural signaling molecules structurally 
similar to THC, along with their receptors— 
plays a central role in brain development. It 
fine-tunes the maturation of the prefrontal 
cortex, a brain area involved in self- 
control and decision-making. In 2019, 
Hurd and her colleagues reported that re- 
peated THC exposure during adolescence 
in rats changed the shape and function of 
neurons in the animals’ prefrontal cortex. 
In her presentation, Hurd showed a neon 
green-and-yellow neuron with sparse, 
stunted branches next to its much bushier 
normal counterpart. The simpler struc- 
ture, Hurd explained, means fewer con- 
tacts with other neurons. 

Her study also revealed a pattern of 
gene expression in the rats’ THC-exposed 
neurons that overlapped significantly with 
gene expression profiles seen in people with 
schizophrenia. It was a hint that early can- 
nabis use might sometimes pave the way to 
this psychiatric disorder, as epidemiological 
studies have suggested. Human studies 
also support concerns that early use might 
have lasting effects. A longitudinal study 
of 799 European adolescents published in 
2021 linked cannabis use with a thinning 
of the prefrontal cortex in regions where 
cannabinoid receptors are expressed, and 
with higher levels of impulsiveness. 

Yet much is not known about adolescent 
risk, Tapert says. She adds that a longitu- 
dinal look at nearly 12,000 U.S. children 
called the Adolescent Brain Cognitive De- 
velopment Study, launched in 2016, should 
provide critical data on how cannabis use 
interacts with characteristics such as a 
person’s genetics, history of trauma, stress, 
and family mental health history. 

However such factors shift the balance, 
the increasing potency of cannabis likely 
adds to the risks. In a 2022 paper published 
in Molecular Psychiatry, Hurd along with 
Jacqueline-Marie Ferland, a neuroscientist 
in her lab, reported that exposure to 
high-dose THC (equivalent to a strong 
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recreational human dose of about 20 milli- 
grams, or four typical gummies), but not 
low-dose (equivalent to one 5-milligram 
edible), once every 3 days made rats un- 
usually sensitive to environmental stress- 
ors such as isolation. After such stress, the 
rats tended to avoid other animals—a sign 
of social anxiety—and to consume more 
sugar than controls, indicating increased 
sensitivity to reward. Earlier this year, 
Ferland, Hurd, and their colleagues re- 
ported in JAMA Psychiatry that high-dose 
THC also caused rats to make risky deci- 
sions in a “rat gambling task,’ in which 
a rat must choose between risky and safe 
strategies for winning sugar pellets, be- 
havior similar to that seen in human study 
participants with cannabis use disorder 
gambling for money. 

High- and low-dose THC also have dis- 
tinct effects on the rat brain, they showed. 
High doses altered the shape of neuronal 
support cells called astrocytes and caused 
changes in gene expression that suggest 
disruptions to signaling by the inhibitory 
neurotransmitter GABA. Low-dose con- 
sumption, on the other hand, primarily 
distorted the shape and gene-expression 
patterns of neurons and spurred changes 
in the opioid system. It’s not yet known 
whether such changes also happen in the 
human brain, but Hurd thinks the rat 
studies hint at worrisome biological con- 
nections between cannabis use and neuro- 
transmitter systems involved in a wide 
range of behaviors. 


IN A 2019 STUDY, 7% of girls and women ages 
12 to 44 reported using cannabis while preg- 
nant. That proportion doubled between 2002 
and 2017, the researchers found, and it may 
be even higher today. The fetus, inevitably, is 
exposed: THC readily passes through the pla- 
centa to the fetal brain. 

To look for possible effects, Hurd began 
a collaboration in the early 2000s with 
Diana Dow-Edwards, a neuropharmaco- 
logist at the SUNY Downstate Health 
Sciences University. At the time, Dow- 
Edwards had access to fetal tissue from 
women with a history of drug use who 
had chosen to have an abortion. In women 
who had smoked cannabis, Hurd and 
Dow-Edwards found alterations to the 
fetal brain’s dopamine system, including 
reduced expression of dopamine receptors 
in the amygdala and nucleus accumbens, 
a reward center. The finding hints that in 
utero cannabis exposure could interfere 
with emotional regulation and boost vul- 
nerability to addiction. 

It’s just part of the havoc the two re- 
searchers uncovered in the fetal brain. THC 
reshuffled gene expression in its natural 
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tetrahydrocannabinol (THC). 
Ironically, she believes another can- 
nabis ingredient, cannabidiol (CBD), 
could help break cannabis depen- 
dence. Her initial focus, though, is on 
testing it to help heroin users. 

Ina seminal study published in 
2009, she showed CBD could reduce 
drug-seeking behavior in rats previ- 
ously exposed to heroin, perhaps by 
reducing craving triggered by cues 
they had associated with the drug. 
“CBD could actually do the opposite 
of THC,” says Hurd, who heads an 
addiction research lab at the Icahn 
School of Medicine at Mount Sinai. 
In 2019, she and her clinical team re- 
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ported that compared with a placebo, 


a CBD capsule taken once a day for 
3 days reduced drug cravings and 
anxiety in 45 human heroin users. 
Hurd’s ballooning clinical crew is 
gearing up for larger trials. First up is 
a trial of CBD in 200 people with opi- 
oid use disorder, followed by larger 
scale trials with long-term follow-up 
that will not only look at substance 
abuse and anxiety, but also general 
life outcomes such as employment, 
parenting, and avoiding trouble with 
the law. Mount Sinai neuroscientist 
and former social worker Keren 
Bachi, who helps direct this effort, is 
impressed by Hurd's ability to think 
like a clinician. “She has the vision of 
designing the studies in a way of see- 
ing whether this intervention would 
make a meaningful difference in the 
lives of people,” Bachi says. —I.W. 
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EVOLUTIONARY BIOLOGY 


The raw material of evolution 


Estimates of whale mutation rates contribute 


to understanding evolutionary processes 


By A. Rus Hoelzel and Michael Lynch? 


volution happens when the code of 
life, DNA, is changed by the process 
of mutation. Mutations include dele- 
tions and insertions, rearrangements, 
and transpositions (moving DNA). 
However, it is the rate of point muta- 
tions, which affect a single site in the chain 
of nucleotides that make up DNA, that is 
most often considered. Knowing the rate 
and pattern of mutation is essential to 
understanding the process of evolution. 
This knowledge increases understanding 
of natural selection and has applications 
such as tracking demography and dating 
phylogenies, but calculating the muta- 
tion rate is not straightforward. On page 
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990 of this issue, Suarez-Menéndez et al. 
(1) use parent-offspring trios and genome 
sequences to estimate rates of point-muta- 
tional change from one generation to the 
next in four species of baleen whales in 
the North Atlantic: humpback (Megaptera 
novaeangliae), blue (Balaenoptera muscu- 
lus), fin (Balaenoptera physalus), and bow- 
head (Balaena mysticetus). 

Generating sufficient data from such in- 
accessible animals is a challenge. However, 
Suarez-Menéndez et al. sampled exten- 
sively enough to identify parent-offspring 
trios—five trios for the humpback whale 
and one for each of the other three spe- 
cies—using a relatively inexpensive and 
rapid genetic method for kinship screen- 
ing. Their results matter because a muta- 
tion rate cannot simply be calculated for 
one species, for which this can be easily 
done, and then applied more generally to 
other taxa. There is no universal mutation 
rate. There is not even a single mutation 


rate for a given taxon—it varies across indi- 
viduals and among nuclear, mitochondrial, 
and chloroplast genomes. The mutation 
rate is subject to natural selection like all 
traits (2, 3), and mutation rates in nuclear 
genomes vary 10,000-fold across the tree 
of life and 40-fold among vertebrates (3). 
Estimating mutation rates in very large 
mammals is of interest in part to help ad- 
dress how species with long generation 
times (average time between two consecu- 
tive generations) avoid high incidence of 
somatic diseases such as cancer. Suarez- 
Menéndez et al. estimated the nuclear point- 
mutation rate for the four species of baleen 
whales combined to be 1.11 x 10-°, which is 
high compared with earlier estimates that 
were based on phylogenies. Therefore, in 
cetaceans a lower than expected incidence 
of cancer may be due to selection, as sug- 
gested by Tejada-Martinez et al. (4) and oth- 
ers, rather than a slower mutation rate. 
After a mutation occurs, the individual 
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with the mutation may fail to survive or 
reproduce because of impairment due to 
the mutation (purifying selection) or sim- 
ply by failing to reproduce by chance (ge- 
netic drift). Therefore, a comparison of 
populations or species after 100 or 1000 
generations will reflect net genetic differ- 
entiation that results from de novo varia- 
tion introduced by mutation, as estimated 
by trio analysis, which is then modified by 
subsequent forces of selection and drift. 
Eventually, the measured rate will reflect 
new variants shared by all members of the 
population (the substitution rate), which 
can be more than an order of magnitude 
lower than the de novo mutation rate, 
but there may be intermediate rates for a 
period of time (5). A further factor that is 
frequently encountered in the relatively 
rapidly changing mitochondrial DNA ge- 
nomes of mammals is saturation. This oc- 
curs when mutation changes one nucleotide 
to another, and then back again. This can 
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Humpback whales (Megaptera novaeangliae) 
are often inaccessible, which makes assessment 
of their mutation rate challenging. 


be anticipated and modeled (6), but there 
are uncertainties involved. Determining 
the mutation rate with trio analyses avoids 
these issues, because time is too short for 
the loss of anything other than a dominant 
lethal mutation. 

Knowledge of the mutation rate can re- 
veal other important aspects of the biol- 
ogy of a species, for example, demography. 
The genetic effective population size (NV,) is 
an idealized approximation of the average 
number of individuals that produce suc- 
cessful progeny per generation, which gov- 
erns the level of noise in evolutionary pro- 
cesses. Sudrez-Menéndez et al. used linkage 
disequilibrium analysis (nonrandom associ- 
ation of loci) and calculated the N, of hump- 
back whales before commercial whaling 
began to be 5,700, though, as the authors 
point out, there are caveats to these estima- 
tions. N, is generally smaller than the total 
number of reproductive adults in the popu- 
lation, owing to variance in reproductive 
success among individuals, demographic 
fluctuations in population size, and delete- 
rious mutations (which can purge variation 
from linked chromosomal regions). The ra- 
tio of N, to the census population size var- 
ies widely (7) and, for the humpback whale, 
may be roughly 0.1 (8). 

All methods for estimating demographic 
trajectories depend critically on accurate 
measures of both the mutation rate and the 
generation time. An alternative method for 
calculating this trajectory is based on co- 
alescence (the time at which evolutionary 
lineages come together) and the knowledge 
that the interval between these coalescence 
points varies with N, [(9), compare with 
(10)]. For example, the skyline model can 
estimate these demographic trajectories 
and incorporate ancient DNA to estimate 
the relevant mutation rate [but see (11, 12)]. 
These rates may be somewhat slower than 
trio-based rates (6) but can provide remark- 
ably close correlation with known histori- 
cal events (13) and are typically faster than 
substitution rates calculated from fossil- 
calibrated phylogenies (J, 5). 

A long-term average N, can be calculated 
from the diversity of the population at pu- 
tatively neutral sites within the genome 
(the behavior of which is not biased by se- 
lection) and the mutation rate. A notable 
negative correlation between taxon-spe- 
cific nuclear mutation-rate estimates and 
their long-term N, is found when organ- 
isms are compared across the entire tree 
of life (14), and now there are enough data 
to see this trend emerging among mam- 


mals. Estimates of mammalian mutation 
rates (per generation) range approximately 
fourfold, from 0.4 x 10-8 (pig) to 1.6 x 10-8 
(gray mouse lemur), with cetaceans having 
relatively high estimated mutation rates, 
in the range of 0.9 x 10° to 1.4 x 10° (J, 
3). The observed negative scaling is consis- 
tent with the drift-barrier hypothesis (15), 
which postulates that as N, declines, drift 
reduces the ability of selection to purge 
mutator alleles. When N, is large, selection 
is more efficient at reducing the mutation 
rate. Other factors that may influence ver- 
tebrate mutation rates include parental 
age, species-level fecundity, a higher muta- 
tion rate in males than females [in birds 
and mammals (3)], and genome size (/4). 
Several uncertainties remain with respect 
to the new data from Suarez-Menéndez et 
al. as well as recent estimates from other 
species of vertebrates (3). Estimates of N, 
that are derived by dividing standing lev- 
els of variation at putatively neutral sites 
by mutation rate are subject to biases, 
with the bias direction depending on the 
relative magnitude of sampling error that 
is associated with estimates of nucleotide 
variation and mutation rates. Additionally, 
most mammalian mutation-rate estimates 
are based on very small numbers of trios 
(often only one), although it is known that 
mutation rates can vary by a factor of at 
least two among individuals within a spe- 
cies (J4). Thus, new estimates such as the 
one presented by Suarez-Menéndez ez al., 
which have been notably lacking for long- 
lived species other than humans, represent 
progress toward a fuller understanding of 
the rate of origin of the raw material of evo- 
lution. At the same time, caution is needed 
when dating past evolutionary events with 
present-day mutation rates and drawing 
inferences about the biological basis for 
taxon-specific mutation-rate differences. 
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HEALTH 


Unanswered questions about 
the causes of obesity 


Obesity is now a global pandemic, but there is little 


consensus about the causes 


By John R. Speakman??34, 
Thorkild |. A. Sorensen®®, 
Kevin D. Hall’, and David B. Allison® 


besity is a major health issue that 

has reached pandemic status with 

no clear solutions. Much has been 

learned over the past 50 years about 

the regulation of body fat. Examples 

include the discovery of the hormone 
leptin, finding thermogenic brown adipose 
tissue in adult humans, elucidating path- 
ways in the brain that affect hunger and 
feeding behavior, quantifying adipocyte 
turnover and the lipids therein, identifying 
single genes that produce rare but severe 
obesity, and finding thousands of genetic 
variants associated with individual differ- 
ences in body mass index (BMI). Despite 
this progress, there remain several key 
questions to be answered to aid the preven- 
tion and treatment of obesity. 

Confusion about the causes of obesity has 
arisen based on the false dichotomy of genes 
versus environment (rather than the com- 
bined effects of genes and environment). 
At any point in time, most of the variance 
in levels of obesity among individuals may 
be genetic. But, changes across time are 
predominantly driven by the environment. 
Which individuals deposit the most fat in 
response to environmental change is influ- 
enced by both. 

Adult human body weight is often stable 
over long periods within a given environ- 
ment. Moreover, individuals often respond 
to imposed perturbations of energy balance 
by altering components of energy expen- 
diture and intake to resist weight change. 
Frequently, body weight (and fatness) re- 
turns to a similar level to what it was before 
the perturbation after it ends. This might 
suggest that body weight is regulated around 
an individual set point (J). But if that is the 
case, why is there an obesity pandemic? Two 
alternatives to a strict set point model in- 
clude that the so-called regulation is an illu- 


sion because when individuals gain weight, 
they also increase expenditure (2), which 
brings them to a new equilibrium (the “dy- 
namic equilibrium point”). Or, there are two 
“control points” that stop humans from get- 
ting too fat or too thin, but between those 
limits there is very little control (the “dual 
intervention point model”) (J). However, 
these alternative models are inadequate de- 
scriptors of the whole obesity phenomenon. 

To develop more-comprehensive models, 
it is necessary to understand the molecu- 
lar mechanisms of intake and expenditure 
control. For example, during periods of en- 
ergy deficit, hunger is increased, and energy 
expenditure is suppressed. Leptin appears 
to play a major role in these responses but 
does not fully explain them. Little is known 
about the mechanisms behind responses to 
overfeeding. Rodent studies suggest that 
activation of thermogenic brown adipose 
tissue or futile energy cycling elsewhere are 
potentially involved, but their contributions 
in humans remain unclear. A system that 
directly senses overall weight (a gravitostat) 
might be involved (3), but the molecular ba- 
sis is ill-defined. 

The sizes of individual organs and tis- 
sues also seem to be regulated. People with 
obesity, almost without exception, also 
have increased lean body mass (all non- 
adipose parts of the body, including skel- 
etal muscles, bones, and viscera) (4). The 
mechanisms involved in partitioning en- 
ergy imbalances between lean versus adi- 
pose tissues have not yet been elucidated. 
The mass of adipose tissue depends on the 
balance between adipogenesis and apopto- 
sis and between lipogenesis and lipolysis 
in adipocytes, but the mechanisms con- 
trolling this are incompletely understood. 
Moreover, the factors that influence the 
relative deposition of lipids into adipose 
versus nonadipose tissues are particularly 
important to understand because ectopic 
lipid deposition seems to be implicated in 
metabolic disease. Elucidating these con- 


nt. 
trol mechanisms has the potential to re ied 
therapeutic targets. ; 5 

There appears to be a balance between 
the nutrient and energy needs of the body 
and food intake. However, little is known 
about how these needs are sensed and how 
the corresponding signals influence appe- 
tite and behavior. For example, although 
short-term bouts of exercise may not reli- 
ably influence food intake (5), prolonged 
exercise interventions and high activity 
stimulate food consumption (6). Although 
leptin acts as a long-term circulating signal 
of overall body energy stores and stimu- 
lates hunger when they are shrinking, other 
currently unknown mechanisms appear to 
link expenditure to intake through rest- 
ing metabolic rate and fat-free mass (7). 
Discovery of these signals will likely be im- 
portant targets for modulating fat storage. . 
However, it is important to recognize that 
the different routes of energy expenditure 
(physical activity, rest, and thermoregula- 
tion) may not be related to each other in an 
independent additive fashion. Elevations of 
one aspect of expenditure may cause com- 
pensatory decreases in other components 
and/or changes in intake. The long- and 
short-term consequences of manipulating 
different components of energy balance re- 
main unclear. 

Food intake and energy expenditure are 
controlled by the brain. Yet, it is unclear 
how the brain orchestrates eating behavior 
in response to signals from the periphery 
and the environment. The brain integrates 
signals from the body regarding nutrient 
and energy needs to modulate food choice 
and the amount consumed. The hindbrain 
and hypothalamus regulate food intake in 
response to signals from the gut and circu- 
lating hormones. The recent success of glu- 
cagon-like peptide 1 receptor agonists for 
the treatment of obesity is because they en- 
gage these brain regions at pharmacological 
doses. There are probably many other brain 
regions involved. For example, there is less 
knowledge about the roles of brain regions 
that support motivation, reward, habit for- 
mation, cognitive abilities, and emotional 
control and how these integrate mutually 
and with environmental cues to generate 
the total feeding response. 

It is presently unclear what social and 
physical factors in the environment are 
driving increased obesity prevalence. 
Physical changes in the food environment 
have likely been important contributors to 
increasing obesity prevalence, but the most 
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Some aspects of the changing food environment, combined with other environmental factors and genetic predisposition, have created the current obesity pandemic. 
However, the details of this interaction remain elusive, leading to many unanswered questions about what might appear to be a simple problem of positive energy balance. 


important factors in food that induce obe- 
sity remain unknown. Intake is not only for 
energy but for specific nutrients as well. 
How nutrients are sensed and the signals 
that control nutrient-specific appetites are 
poorly understood. The protein leverage 
model (8) suggests that protein content has 
been diluted by excess carbohydrate and 
fat, thereby promoting excess energy intake 
to meet the body’s protein needs. However, 
the determinants of protein requirements, 
the signals of protein deficiency and sur- 
plus, and the resultant mechanisms con- 
trolling total protein intake are unclear. 
Alternatively, the carbohydrate-insulin 
model (9) suggests that the increase in high 
glycemic carbohydrates in food results in 
increased insulin levels and thereby drives 
adipose tissue lipid accumulation and con- 
sequently increased appetite and decreased 
energy expenditure. But, the mechanisms 
remain to be elucidated. Fructose does not 
stimulate insulin but may elicit different 
mechanisms that lead to fat storage. 
Energy density of food, determined 
partly by water and fat content, has a pro- 
found effect on energy intake, at least over 
the short term (10). Laboratory rodents and 
humans consuming high-fat diets eat less 
weight of food but overconsume energy. 
This leads to some intriguing questions 
about why the intake of high-energy den- 
sity foods is so difficult to regulate and to 
what extent increased availability of high- 
energy density foods has contributed to 
the pandemic. Conversely, a reduction in 
low-energy density alternatives may be a 
driver. For instance, dietary fiber has low 
energy density, and its intake has declined 
over the past 30 years in the USA. Are there 
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mechanisms by which fiber exerts effects 
on satiety and thereby limits energy intake 
other than by lowering food energy density? 
The availability and marketing of ultrapro- 
cessed foods have substantially increased 
over several decades, and diets high in such 
foods promote greater energy consumption 
(11). It is not yet known what attributes of 
ultraprocessed food result in excess energy 
intake. Certain nutrient combinations may 
be hyperpalatable and influence the activ- 
ity of brain regions involved in reward and 
motivation, thereby promoting their over- 
consumption, but the specific mechanisms 
remain unclear. Whether noncaloric sweet- 
eners and the thousands of food additives 
common in modern foods affect satiety and 
energy intake is also unclear. 

Social factors may be a feature of the 
obesogenic environment that differs among 
and within societies, and the distribu- 
tion and impact of these factors may have 
changed over time. A strong environmen- 
tal correlate of obesity is social adversity, 
including poverty, particularly in high- 
income countries (12). However, in low- and 
middle-income countries, poverty tends to 
be associated with lower BMI. Perhaps, a 
combination of food availability and food 
insecurity explains this pattern, so that in- 
dividuals who experience food insecurity 
despite food availability become obese. But 
what triggers the perception of food insecu- 
rity, and what the physiological reaction to 
it is, including increased fat storage, remain 
unknown. Chronic stress may be an impor- 
tant factor. Moreover, these relationships 
appear more prominent among women 
than men and may contribute to obesity 
occurring more frequently in women than 


in men. Such processes may be reinforced 
owing to the stigmatization and discrimina- 
tion that people with obesity suffer, creating 
a vicious cycle. 

The environment and societal changes 
may also have affected physical activity. 
Occupational physical activity in the USA 
has declined since the 1950s (13), but leisure- 
time activity has increased. Energy expendi- 
ture resulting from physical activity has not 
declined since the late 1980s in the USA and 
Europe (/4). Moreover, low energy expendi- 
ture from physical activity does not predis- 
pose individuals to obesity. Although people 
with obesity are less active, they do not 
spend less total energy on activity. Although 
it is often asserted that increasing sedentary 
behavior is a major cause of the obesity pan- 
demic, this is far from clear, and present evi- 
dence does not support this conclusion. 

Environmental drivers of the obesity 
pandemic have not resulted in uniform 
increases in adiposity within and between 


populations, potentially also because of . 


genetic differences between individuals. 
Individual variation in responses to many 
of the physiological and environmental fac- 
tors contributes to variation in obesity sus- 
ceptibility. Much is known about the rare, 
extreme monogenic forms of obesity, but 
there are thousands of single-nucleotide 
variants (SNVs) associated with BMI (J5). 
However, knowledge about the mechanisms 
by which these SNVs become associated 
with BMI is limited, both because of the 
inadequacy of BMI as a measure of obe- 
sity and because of uncertainty about the 
causal mechanisms linking genetic variants 
to phenotypes. Moreover, at the fetal stage 
and early in life, individuals are wholly de- 
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pendent on their mothers for nourishment. 
Studies of rats and mice show that mater- 
nal obesity and diet influence offspring sus- 
ceptibility to obesity, possibly by epigenetic 
mechanisms (75), but whether this also ap- 
plies to humans is uncertain. 

Given the diversity of environmental and 
genetic causes of obesity, there are many 
routes to excess adiposity. There may be 
subtypes of obesity, which could guide spe- 
cific prevention and treatment. At present, 
however, there are many uncertainties, 
such as how the different subtypes might be 
defined. Individual variation in food pref- 
erence is enormous, but the basis of this 
variation and its consequences for obesity 
development remain unknown. Whether 
different food choices lead to adaptations 
of the gut and the resident microbiota to 
alter energy absorption efficiency is also 
unclear. In some laboratory rodent models, 
differences in absorption efficiency can be 
sufficient to drive large differences in fat 
storage, but whether the same applies to 
humans has not been sufficiently studied. 
Gut microbiota differences may be associ- 
ated with absorption efficiency and may 
have effects on other aspects of energy 
balance. It remains unclear whether dif- 
ferences in the microbiota between people 
with and without obesity are consistent and 
reproducible and whether these differences 
are consequences or causes of obesity. 

By building on the considerable advances 
made in the past 50 years, the study of the 
causes of obesity promises to be a rich area 
for discovery that may be transformative for 
the lives of millions of people. Improving 
our understanding of environmental driv- 
ers and how these interact with genetic 
composition is vital to making future in- 
roads to this serious medical condition. & 
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Arctic sea ice, ocean, 
and climate evolution 


Wind variability affects the rate of Arctic sea ice decline 


By Sheldon Bacon 


rctic sea ice decline is caused by global 
warming and is accelerated by regional 
feedbacks resulting from changes in 
absorption and reflection of heat by 
ice and snow (albedo), among other 
processes. By the end of the century, 
Arctic sea ice is confidently expected to dis- 
appear in summertime, although this will 
probably happen sooner (1). The Arctic sea 
ice area decline further increases global 
warming rates because the albedo reduction 
means that more heat is absorbed by Earth. 
The Arctic sea ice changes by themselves are 
concerning, but remote atmospheric and oce- 
anic linkages also force changes in weather, 
climate, and extreme events at lower lati- 
tudes (2). On page 972 of this issue, Polyakov 
et al. (3) show that mid-term variability in the 


regional atmospheric circulation can control 
decadal changes in the rate of sea ice decline. 

The Arctic Ocean is stratified, or lay- 
ered, nearly horizontally in density. Sea 
ice floats on the cold, fresh, ~200-m-thick 
halocline. The halocline in turn overlies 
the saline and relatively warm Atlantic 
water layer, which is ~1 km thick. Below 
are deep, dense, and very slow-moving 
waters. Summertime heating and melting 
generate a thin mixed layer in the surface 
that largely disappears with winter freez- 
ing. Thus, the halocline largely insulates 
the underlying ocean from surface fluxes 
of heat and momentum. As a result, across 
most of the Arctic Ocean, circulation is 
sluggish, turbulence and mixing are largely 
quiescent, and sea ice loss is mainly caused 
by atmospheric forcing—wind and heat 
acting on the sea surface. 


The Arctic Ocean in the past and future 

In the past, sea ice almost entirely covered the Arctic Ocean, and the ocean displayed slow 
currents (left). In the future, sea ice is predicted to disappear, with currents’ speed increasing 
and causing turbulence to intensify, particularly over the continental shelf edge (right). 

This could cause upward heat flux to the surface that might further reduce sea ice, even in winter. 
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Differences to this relatively simple pic- 
ture of the Arctic Ocean used to exist in the 
western Eurasian Basin (north and east of 
Svalbard, north of Norway) where Atlantic 
water entering the Arctic through the Fram 
Strait (between Svalbard and _ northeast 
Greenland), at or near the surface, remained 
in contact with the atmosphere. In a previ- 
ous study (4), the Atlantic character was seen 
to be extending eastward, toward eastern 
Siberia, between the early 2000s and the 
mid-2010s, leading to the notion of Arctic 
Ocean “Atlantification.” It was shown that sea 
ice reductions, weakening of the halocline, 
and reduction in the depth of the Atlantic 
water layer were together making the eastern 
Eurasian Basin, north of Siberia, resemble the 
western basin in terms of its Atlantification. 
The consequent increased ocean heat flux 
was reducing the winter sea-ice formation 
rate, thus explaining the then-recent reduc- 
tion in Arctic sea ice cover in the eastern ba- 
sin. The hypothesis was that Atlantification 
was moving part of the Arctic Ocean toward 
a new climate state. 

Polyakov et al. build on these observed 
changes in atmospheric circulation and in ice 
and ocean behavior. After decades of Arctic 
sea ice decline, they show that ice area and 
thickness have been stable, although variable 
since 2007. The decadal-scale variability of 
the atmospheric circulation explains this sea 
ice behavior. 

Atmospheric circulation variability is de- 
scribed by using normal modes: patterns of 
variability in which all parts of the system 
change together. In the Arctic, the leading 
mode, the Arctic Oscillation, represents the 
variability of the polar high-pressure system. 
The second mode, the Arctic Dipole, is as- 
sociated with anticyclonic winds over North 
America and cyclonic winds over Eurasia. 
The Arctic Dipole was roughly neutral before 
2007 and increasingly positive thereafter. In 
its positive phase, the Arctic Dipole weakens 
inflows to the Arctic Ocean across the Fram 
Strait while strengthening inflows through 
the Barents Sea, northeast of northern 
Norway. The resulting stronger Arctic Ocean 
circulation has been responsible for increased 
freshwater accumulation in the Amerasian 
Basin, which in turn has increased stratifica- 
tion and reduced upward oceanic heat fluxes, 
slowing Arctic sea ice loss. This atmospheric 
forcing is consistent with observed and mod- 
eled variability of Arctic Ocean freshwater 
accumulation and export (5, 6). An eventual 
return to dominance of the Arctic Oscillation 
mode would reinstate the faster sea ice re- 
duction. Regardless of the mode in place, 
both the Arctic atmosphere and the Arctic 
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Ocean are predicted to continue to warm 
faster than the rest of the planet through the 
21st century (1,7). 

There remain unexplored and plausible 
ocean mechanisms that may further acceler- 
ate Arctic warming, with consequences for 
the Arctic ice and ocean system, and with 
possible atmospheric feedbacks. When the 
Arctic Ocean is completely covered by sea 
ice, stresses exerted by winds are largely ab- 
sorbed by the ice and ultimately transmit- 
ted to land, greatly reducing the mechanical 
forcing of the ocean, hence the slow circu- 
lation. When the sea ice declines, more of 
the ocean will be directly exposed to wind, 
increasing the efficiency of atmosphere-to- 
ocean momentum transfer and accelerating 
the ocean circulation, called ocean “spin-up.” 
Consequently, parts of the ocean, particularly 
where it flows over rough topography around 
the edge of the continental shelf, would be- 
come more turbulent, increasing the upward 
heat flux out of the Atlantic water layer (8) 
and likely spinning off more eddies (circu- 
lar currents), carrying that heat toward the 
North Pole (see the figure). The strength of 
the Coriolis force—the force on a fluid that 
results from the rotation of Earth—in high 
latitudes changes the balance of physical pro- 
cesses. Exotic turbulence mechanisms such 
as unsteady lee (standing) waves can be ex- 
pected to dominate (9, 10), and they would be 
less season-dependent, further threatening 
Arctic sea ice survival outside summertime. 
Such mechanisms are not represented in any 
current forced ocean or coupled climate mod- 
els and are the subject of ongoing research. 

In the Arctic Ocean, Atlantic-layer heat is 
presently trapped below the halocline, except 
where Atlantification has been taking place. 
However, ocean spin-up will increase turbu- 
lence and mixing, which might release this 
heat more widely and in turn may accelerate 
the year-round sea ice decline. This would 
bring further and unknown consequences 
for both Arctic and mid-latitude weather, cli- 
mate, and extreme events. 
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ANTHROPOLOGY 


Did our 
ancestors 
nearly die out? 


Genetic analyses suggest an 
ancient human population 
crash 900,000 years ago 


By Nick Ashton! and Chris Stringer? 


arth’s climate system began to change 

during the Middle Pleistocene transi- 

tion, which is associated with a severe 

cooling phase about 900,000 years ago. 

How this change might have affected 

human populations is difficult to deter- 
mine, because the human fossil and archaeo- 
logical records are relatively sparse for this 
period and lie beyond the reach of ancient 
DNA recovery. On page 979 of this issue, Hu 
et al. (1) use a new method of analysis called 
FitCoal to project current human genetic 
variation backward in time, to estimate the 
size of populations at specific points in the 
past. The results suggest that our ancestors 
suffered a severe population bottleneck that 
started around 930,000 years ago and lasted 
for almost 120,000 years. This is estimated 
to have reduced the number of breeding in- 
dividuals to ~1300, bringing our ancestors 
close to extinction. 

Hu e¢ al. argue that the proposed bottle- 
neck correlates with a chronological gap in 
the African and Eurasian fossil records and 
may have led to the evolution of a new hu- 
man species, ancestral to Homo sapiens. They 
favor a widely defined H. heidelbergensis as 
this ancestral species, probably emerging in 
Africa by 800,000 years ago. Two lineages of 
large-brained humans have long been recog- 
nized in the later Pleistocene: H. sapiens and 
Neanderthals (H. neanderthalensis). A third 
group—Denisovans—was identified more re- 
cently from ancient DNA in fossils and sedi- 
ments at Denisova Cave in Siberian Russia 
(2). Considering how the inferred bottleneck 
might have affected human evolution in- 
evitably leads to debate about the nature of 
the last common ancestor of H. sapiens, Ne- 
anderthals, and Denisovans, and when and 
where this ancestor lived. 

It is not yet clear whether the last common 
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ancestor lived in Europe, Asia, or Africa. Hu 
et al. attributed the East Asian fossil record 
from that time to the more ancient human 
species H. erectus. However, there is evidence 
from sites such as Yunyang, China, of a dis- 
tinct species that is morphologically closer to 
later humans from Eurasia and Africa such 
as H. sapiens and Neanderthals (3), suggest- 
ing their ancestral lineage(s) might already 
have diverged from H. erectus by the time 
of the Yunyang fossils, which are dated at 
800,000 to 1,100,000 years ago. Moreover, 
some genetic models for the deep ancestry 
of H. sapiens and Neanderthals suggest that 
concepts of a single last common ancestor in 
time and space might be illusory (2, 4). 
Estimates using genomic data from H. 
sapiens, Neanderthals, and Denisovans 
calibrate the last common ances- 
tor to between about 500,000 
and 700,000 years ago (2), which 
would relate the inferred bottle- 
neck to this ancestral population, 
wherever it lived, with compa- 
rable signals expected from Den- 
isovan and Neanderthal genomes 
as well as from those of H. sapi- 
ens. However, some recent stud- 
ies of dental and cranial variation 
in fossilized skulls place the last 
common ancestor earlier, between 
about 800,000 and 1,200,000 years 
ago (5, 6), which might mean that 
already separated basal lineages 


cupation in Africa, Asia, and Europe that 
have been attributed to this period of re- 
versed polarity and, with varying degrees of 
certainty, to the inferred time of the bottle- 
neck. These include sites in Kenya such as 
Kilombe (GqJh1 and Gqjh2), Kariandusi 
and Isinya (8), and locations in Bed IV of 
Oldupai in Tanzania (9). Of particular im- 
portance are the human fossils from Gom- 
bore II in Ethiopia that have an estimated 
age of 850,000 years before present (JO), 
which lies within the bottleneck window. 
Sites in Europe, such as Gran Dolina and 
Boella in Spain, Monte Poggiolo in Italy, and 
Happisburgh 3 in the UK, have also been 
attributed to this period (17). Of further im- 
portance are the increasing number of sites 
in China, particularly in the Qinling Moun- 


Archaeological evidence during the bottleneck 
Evidence of human occupation, including fossilized human bones (sites in 
bold), dated to the proposed population bottleneck [813,000 to 930,000 
years before present (kyr B.P.)], is recorded in Asia, Europe, and Africa. 

Gray bars represent the range of age estimates for when site occupation is 
thought to have occurred, and arrows indicate that occupation is thought 

to have extended beyond the date range shown [based on (3, 8-14)]. This 
suggests that the effects of the bottleneck were limited geographically and 
chronologically. The stone tools from Trinil are disputed. 
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tain Range, which cover the time span of 
the proposed bottleneck (12). This includes 
the human fossils from Yunyang (3). 

These fossil records dating to the inferred 
bottleneck period 813,000 to 930,000 years 
ago suggest that humans were widespread 
inside and outside of Africa. Therefore, what- 
ever caused the proposed bottleneck may 
have been limited in its effects on human 
populations outside the H. sapiens lineage, 
or its effects were short-lived. This also im- 
plies that the cause of the bottleneck was 
unlikely to have been a major environmental 
event, such as severe global cooling, because 
this should have had a wide-ranging impact. 
Nevertheless, the provocative study of Hu et 
al. brings the vulnerability of early human 
populations into focus, with the implication 
that our evolutionary lineage was 
nearly eradicated. Recent work , 
suggests that Europe was prob- 
ably completely depopulated after 
a previously unrecognized cold 
phase about 1,100,000 years ago, 
before the proposed bottleneck (77). 
If methods such as FitCoal can be 
further applied to growing data : 
from the genomes of H. sapiens, 
Neanderthals, Denisovans, and 
hopefully others, this should clarify 
ancient bottlenecks, and which re- * 
gions under habitation might have 
been the most or least affected. 
However, the genomic data must 


of Neanderthals and Denisovans Hiimantossis found... population Chron also be tested against the fossil and 
avoided the bottleneck, or suffered at bolded sites | ' bottleneck archaeological records of early hu- 
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the figure). Despite issues with 
the resolution of dating, there are 
sites with evidence of human oc- 


la Boella 
Gran Dolina 
Happisburgh3 = oF 


—————Sa Europe 


10.1126.science.adj9484 


948 1SEPTEMBER 2023 « VOL 381 ISSUE 6661 science.org SCIENCE 


POLICY FORUM 


CLIMATE AND ECOLOGY 


Unlock the Endangered Species 
Act to address GHG emissions 


For the first time, ESA evaluations can include impacts on 
polar bears from greenhouse gas emissions 


By Steven C. Amstrup' and Cecilia M. Bitz? 


n 2008, projections that up to two-thirds 

of the world’s polar bears could disap- 

pear by mid-century (J) led to polar bears 

becoming the first species listed under 

the US Endangered Species Act (ESA) 

because of threats from anthropogenic 
climate warming. Updated analyses (2) cor- 
roborated the 2008 projections and showed a 
linear but inverse relationship between Arctic 
sea ice extent and global mean temperature. 
Despite the relationship between warming 
and sea ice loss, absence of a quantitative 
link between anthropogenic greenhouse gas 
(GHG) emissions, sea ice loss, and declin- 
ing polar bear vital rates has foiled full ESA 
implementation for polar bears. By quantify- 
ing the relationship between anthropogenic 
GHG emissions and polar bear recruitment, 
we show that sensitivities to cumulative 
anthropogenic emissions explain observed 
population trends, allow estimation of demo- 
graphic impacts from new emissions sources, 
and enable ESA procedures to assess global 
warming impacts of proposed actions—along 
with impacts on the ground. 

Section 7 of the ESA provides a process by 
which federal agencies ensure that actions 
they take, including those they fund or au- 
thorize through leasing activities or issuance 
of permits (e.g., oil and gas production), do 
not jeopardize the continued existence of 
listed species [see supplementary materials 
(SM)]. But in October 2008, then-Solicitor of 
the Department of Interior David Bernhardt 
issued a memorandum [M-Opinion M-37017 
(see SM)] stating that Section 7 consultations 
would not be required “unless it is estab- 
lished that emissions from a proposed action 
would cause an indirect effect to listed spe- 
cies or critical habitat.’ Addressing his “un- 
less” requirement, Solicitor Bernhardt con- 
cluded: “Any observed climate change effect 
on a member of a particular listed species 
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or its critical habitat...would be the conse- 
quence of the collective GHG accumulation 
from natural sources and the worldwide an- 
thropogenically produced GHG emissions 
since at least the beginning of the industrial 
revolution” and therefore “cannot be attrib- 
uted to the emissions from any particular 
source.” This inability to attribute negative 
consequences for polar bears to emissions 
from anthropogenic GHG sources has pre- 
vented the ESA from considering anthropo- 
genic GHG emissions when evaluating im- 
pacts on polar bears from proposed actions. 
This limitation eviscerated the ESA with re- 
gard to the global warming threat that justi- 
fied the 2008 listing of polar bears. 


POLAR BEARS, SEA ICE, EMISSIONS 

Polar bears occur in 19 somewhat distinct 
subpopulations within four major eco- 
regions (see SM). Throughout their range, 
they rely on sea ice over productive conti- 
nental shelf waters to catch their prey (J-3). 
When sea ice concentration falls to levels 
preventing effective foraging, polar bears 
are forced onto land or onto ice that has 
retreated far from shore and over deep un- 
productive waters. In both situations, they 
are largely food deprived (4, 5). During these 
forced fasting durations (FDs), polar bears 
survive on accumulated fat reserves and lose 
nearly a kilogram of body mass each day (6). 
Anthropogenic climate warming shortens 
the period during which polar bears can for- 
age from sea ice to build up their fat reserves 
and lengthens their fasting periods. 

The fundamental dependence of polar 
bears on sea ice (1-6) and the documented re- 
lationship between declining sea ice and cu- 
mulative anthropogenic carbon dioxide (CO,) 
emissions (7) assures that polar bear distri- 
bution and abundance ultimately can only 
decline as cumulative emissions increase. 
Yet, a quantifiable link between polar bear 
demographics and sea ice attributes such 
as extent, area, thickness, and volume has 
previously evaded discovery. In the absence 
of such a quantifiable link, previous polar 
bear studies have depended on associations 


between modeled future sea ice changes 


estimates, or assumptions based on the gen- 
eral relationship between demographic per- 
formance and sea ice availability (8, 9). 

By contrast, Molnar et al. (10) recognized 
that demographic performance is determined 
by energy reserves of bears at fast initiation, 
their energy expenditures while fasting, and 
fast duration. The polar bear’s physiological 
constraints, unlike demographic estimates 
derived from current population assess- 
ments, will remain unchanged, even as sea 
ice is changing, and provide a more robust 
foundation for assessment of future impacts. 
Molnar et al. (10) established two critical 
features of FD: (i) fasting impact thresh- 
olds—FDs beyond which the percentage of 
recruitment and survival failure increase rap- 
idly—and (ii) the demographic “sensitivity” 
when FD exceeds the fasting impact thresh- 
olds (determined by the regression intercept 
and slope of increasing recruitment and sur- 
vival failure—versus FD; see fig. S2a). Molnar 
et al. (10) defined FD as 24 days shorter than 
the number of ice-free days (IFD). Because 
IFD determines FD, IFD is the critical sea ice 
attribute providing the previously missing 
quantifiable link between observed sea ice 
decline and the polar bear’s ability to main- 
tain physiological and reproductive health. 

Molnar et al. (10) showed that survival of 
cubs (recruitment into the next generation), 
determined by the mother’s declining abil- 
ity to provide enough milk for a cub’s early 
survival, is the first demographic threshold 
crossed as FD increases (see SM). Because 
populations that are not successfully recruit- 
ing young can only decline, and because the 
recruitment impact threshold is the most 
sensitive to FD, we focused on the sensitivity 
of cub recruitment to emissions (see SM). 

Armed with new knowledge of FD impacts 
on polar bear recruitment (J0), here we link 
FD to cumulative anthropogenic GHG emis- 
sions. This reveals a causal connection be- 
tween emissions from proposed actions and 


6 
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present demographic or population viabisey— 


impacts on polar bears and helps explain re- . 


cent population trends. Further, by quantify- 
ing polar bear impacts from anthropogenic 
GHG emissions rather than atmospheric con- 
centrations, we disproved claims in M-37017 
that consequences from current anthropo- 
genic GHG emissions cannot be separated 
from natural sources or past accumulation of 
anthropogenic emissions. 

Because climate warming and its associ- 
ated sea ice loss result from all anthropogenic 
GHG emissions, not just CO,, we considered 
CO, equivalent (CO,-eq) emissions—the com- 
bination of GHG pollutants that contribute to 
climate warming—adjusted for their global 
warming potential. Considering CO,-eq also 
is important because emissions and impacts 
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of some GHGs can be more quickly and easily 
mitigated than CO,,. 

We set FD to 24 days less than the number 
of IFD in the Seasonal Ice Ecoregion and equal 
to IFD elsewhere (see SM). As in Molnar et 
al. (10), we defined crossing the recruitment 
impact threshold as the first time FD exceeds 
117 days for at least three of five consecutive 
years (10). This decision rule recognizes that 
FD is increasing on average with increasing 
cumulative emissions, that bears cannot re- 
cover from clusters of years above the impact 
threshold, and that future FD will increase on 
average as anthropogenic emissions increase. 
Detailed methods, including uncertainty esti- 
mates, are described in the SM. 


Sensitivity of FD and recruitment failure to cumulative CO,-eq emissions 


Sensitivity to cumulative CO,-eq emissions of FD (-y,) and recruitment failure (B,y,) for all polar bear subpopulations that have 


sea ice showed the most rapid change with 
increasing emissions (steepest FD/CO,-eq 
slopes). In the Chukchi Sea, for example, FD 
increased from ~12 days when satellite imag- 
ery of sea ice first became available in 1979 
(see SM) to ~137 days in 2020 (table S2), and 
another day of fasting was added for each 14 
(0, 18) Gt CO,-eq (hereafter 95% confidence 
intervals appear in parentheses after cen- 
tral estimates) released into the atmosphere 
(see the table). Contrary to what might have 
been expected, the Seasonal Ice Ecoregion, 
where sea ice always has melted entirely dur- 
ing summer, saw the slowest rate of FD in- 
crease. The most gradual slope was observed 
in southern Hudson Bay, where FD increased 


experienced at least 10 years with ice-free seasons from 1979 to 2020. 


The sensitivity (regression slope) of recruit- 
ment to cumulative emissions ranges from 
0.11 (0.084, 0.15) % Gt" in the Chukchi Sea 
to 0.016 (0.0, 035) % Gt" in southern Hudson 
Bay (see the table). These sensitivities, when 
multiplied by the emissions accumulated 
since each subpopulation region crossed the 
117-day recruitment impact threshold, sug- 
gest recruitment impacts that are consistent 
with independent estimates of population 
status and trend. 

For example, in western Hudson Bay 
(WHB), the best-known of all polar bear sub- 
populations, the 117-day recruitment impact 
threshold was reached in 1994 (table S1). The 
1193 Gt of CO,-eq emitted from 1994 to 2020 
and the recruitment failure 
rate of 0.025 (0.0072, 0.046) 
% Gt" emissions (see the ta- 
ble) would mean an ~30 (9, 
55) % decline in recruitment 
since crossing the thresh- 


OCCUPIED AREA FD/CO,-eq (d Gt”) CO,-eq/FD AREA WEIGHTED FD/CO,-eq RECRUITMENT FAILURE/ old. This percentage decline 

SUBPOPULATION (km2) A %; (Gtd3)1/y, (a km?Gt*) Ay, CO,-eq (% Gt) By, would mean that the annual 
South Beaufort Sea 135,000 0.043 (0.024, 0.065) 23°15, 41) 5800 (3300, 8800) 0.065 (0.087, 0.099) cub survival rate in WHB, 
Chukchi Sea 1,280,000 0.073 (0.056,0.096) 14(10,18) 93,000(71,000,120.000) 0.11 (0.084, 0.15) which was estimated as 
~70% during the 1980s (11), 

Laptev Sea 1,610,000 0.048 (0.036, 0.065) 21(16,28) 77,000 (57,000, 100,000) 0.073 (0.054, 0.098) erauld nae he 400% - Such 
Kara Sea 980,000 0.067 (0.048, 0.091) 15 (11,21) 65,000 (47000, 89,000) 0.010 (0.073, 0.14) a decline in annual cub sur- 
Barents Sea 786,000 0.064 (0.035, 0.098) 16(10,29) _51,000(27000, 77000) 0.098 (0.052, 0.15) vival is consistent with the 
low proportion of yearlings 

East Greenland 617000 0.023 (0.009, 0.098) 43(25,113) — 14,000 (5,400, 24,000) 0.035 (0.013, 0.060) slseryed Waceuseamand 
Kane Basin 48,200 0.029 (0.016,0.044) 35(23,62) 1,400 (770, 2100) 0.043 (0.024, 0.067) population estimates sug- 
Lancaster Sound 229,000 0.12 (0.002,0.025)  84(39,360) 2,700 (50,5700) 0.018 (0.0035, 0.038) gesting an ~30% decline in 
numbers between 1987 and 

Baffin Bay 616,000 0.027 (0.016,0.040) 38(25,63) 16,000 (9,800, 25,000) 0.040 (0.024, 0.061) 9016 (12). The most recent 
McClintock Channel 133,000 0.018 (0.006, 0.033) 54(30,159) — 2,500 (830, 4400) 0.028 (0.0094, 0.051) (2021) estimate, however, 
Gulf of Boothia 61,000 0.037 (0.023,0.052) 27(19,43) 2,300 (1,400, 3,200) 0.056 (0.035, 0.079) suggests that the WHB pop- 
= ulation may have declined 

Foxe Basin 502,000 0.023 (0.011,0.037) 44(27,88) 12,000 (5700, 19,000) 0.034 (0.017, 0.056) by early half since the late 
West HudsonBay 176,000 0.017 (0.005,0.031)  60(32,199) 2,900 (840, 5400) 0.025 (0.0072, 0.046) 1980s (J). The recruitment 
South HudsonBay 399,000 0.010 (0.000, 0.023) 93(44,inf) 4,100 (0,9100) 0.016 (0.00, 0.035) sensitivity to emissions in 
- - WHB, therefore, may be 
Davis Strait 554,000 0.024 (0.009, 0.042) 44(24,115) 13,000 (4,800, 23,000) 0.036 (0.013, 0.064) dasarto the upper end of 
Total 8,120,000 366,000 (319,000, 420,000) our confidence interval than 
to the middle, meaning a 

OUTCOMES from ~139 days to ~157 days since 1979, and | current annual cub survival rate of ~32%. 


Emissions and fasting duration 

Molnar et al. (10) found an essentially linear 
relationship between FD and recruitment 
impact (J0). We found that the relationship 
between FD and cumulative CO,-eq emis- 
sions, for subdomains where polar bears oc- 
cur, is also largely linear—by estimating the 
regression relationship between FD and each 
gigatonne (Gt) of cumulative emissions (see 
the figure, table, figs. Sl and $2, and SM). For 
ease of interpreting impact on polar bears, we 
took the reciprocals of regression line slopes 
to reveal the emitted amount of CO,-eq that 
prolongs FD by one additional day (see the ta- 
ble). Emission levels that add 1 day to FD var- 
ied greatly among polar bear subpopulations. 
FD in areas historically covered by perennial 
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93 Gt CO,-eq were emitted for each day added 
to FD (see the figure, table $2, and SM). 


Emissions and demographic impact 

The relationship between FD and cumulative 
CO,-eq emissions can be combined with the 
relationship between declining recruitment 
and FD [from Molnar e¢ al. (10)] to calculate 
the rate at which polar bear recruitment has 
declined with cumulative emissions. The re- 
lationship between declining recruitment 
(increasing percentage recruitment failure) 
and cumulative emissions not only explains 
observed recruitment trends but also allows 
estimating the recruitment response to emis- 
sions from proposed future actions (see SM 
methods and fig. $2). 


Alternatively, a larger numerical decline may . 


mean that WHB crossed the recruitment im- 
pact threshold earlier. Molnar et al. (0) esti- 
mated that the effect of not including the 24- 
day adjustment to the onset of fasting would 
mean threshold crossings 10 to 30 years 
earlier than with the adjustment. Indeed, if 
WHB bears begin to fast as soon as sea ice 
extent drops below 30%, they would have 
exceeded the 117-day recruitment impact 
threshold before 1979, suggesting a 44 (13, 
81) % decline in recruitment and a current 
annual cub survival rate of 40%. Crossing 
the recruitment impact threshold earlier is 
consistent with reduced body condition in 
females, disappearance of early weaning of 
cubs, and reduced cub recruitment reported 
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Linking CO,-eq to polar bear fasting and recruitment failure 

Shown are annual observations (dots) and regressions (solid lines) by subpopulation. Dashed lines are mean 95% 
confidence intervals (Cls) for sampling plus observational uncertainty. Shading illustrates sampling uncertainty 
only for comparison. See table for regression slopes and 95% Cls, and supplementary materials for details. 
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before 1994 (73). With or without the 24-day 
offset in FD, the recruitment declines that we 
estimate corroborate recent observations of 
poor cub survival and major population de- 
cline in WHB. See also the table and figure, 
fig. S3, and SM. 

As when evaluating surface disturbances 
or toxic chemical releases, impacts of GHGs 
emitted from individual actions may appear 
small. This emphasizes the importance of as- 
sessing cumulative impacts at the program- 
matic rather than individual action level. For 
example, emissions from each of the many 
new actions anticipated to occur on US pub- 
lic lands between 2020 and 2050 may appear 
to have minimal impact on polar bears in iso- 
lation. Cumulatively, however, their projected 
24.1 Gt of CO,-eq emissions (see SM) will 
decrease polar bear cub recruitment by 0.6 
(0.17, 1.1) % in WHB and by 2.7 (2.0, 3.6) % in 
the Chukchi Sea. Similarly, although each of 
the hundreds of power plants in the US make 
a relatively small contribution to emissions, 
together their nearly 2 Gt annual CO, emis- 
sions (see SM) and 60+ Gt emissions over 
30+ year life spans reduce recruitment in the 
southern Beaufort Sea by ~4% (see SM). 

Interpreting demographic impacts of in- 
dividual actions can be likewise confounded 
by propagation of uncertainties through 
our estimation process—resulting in wide 
interval estimates. We therefore cannot 
overemphasize the need to refine emissions 
estimates at all levels of reporting, as well 
as to fill the gap in measurements of body 
masses at which polar bears in most regions 
initiate summer fasts (0). 
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OTHER SPECIES AND SYSTEMS 

We focus on polar bears because the Bern- 
hardt memo (M-37017) was prompted by the 
ESA listing of polar bears and because we have 
the energetics data (J0) to make the quantita- 
tive link between emissions, sea ice, and po- 
lar bear demographics. But ramifications of 
our findings go far beyond polar bears and 
sea ice. In presenting this polar bear case his- 
tory, our outcomes show by example how the 
impact of emissions can be determined and 
included in Section 7 consultations for other 
species and habitats threatened by climate 
warming. For example, although we person- 
ally lack data, it is logical to believe that links 
between GHG emissions, sea level rise, and 
nesting habitat for marine turtles and beach 
nesting birds could be established. Also, it 
seems reasonable that similar regression 
links could be established between emissions, 
water temperature change, and impacts on 
both freshwater and marine species of plants 
and animals. We are confident that our polar 
bear example will lead other investigators to 
uncover parallels in their data, providing nu- 
merous other species with previously unavail- 
able ESA protections from climate warming. 
It is also reasonable that GHG emissions 
could be similarly linked to other phenom- 
ena—beyond the purview of the ESA. 

We have confirmed the direct link be- 
tween CO,-eq emissions and polar bear de- 
mography. In addition to aiding interpre- 
tation of observed population trends, this 
allows estimation of impacts from proposed 
GHG-emitting actions. Because the ESA re- 
quires following the best available science, 


the Department of the Interior now has the 
scientific justification and duty to rescind 
M-Opinion M-37017—allowing the US Fish 
and Wildlife Service to include anthropo- 
genic GHG emissions in Section 7 consulta- 
tions and meet its responsibility under the 
ESA to protect polar bears as well as other 
species imperiled by climate warming. This 
will provide the government an important 
and previously unavailable tool for address- 
ing anthropogenic climate warming. 

Small impacts of most individual actions 
emphasize that GHG emissions are the epito- 
mic cumulative impacts issue—requiring mit- 
igation at programmatic, regional, national, 
and international levels. We must remember, 
however, that the growth in cumulative GHG 
emissions by over 1750 Gt from 1979 to today 
happened largely one action at a time with- 
out regard for cumulative effects in the atmo- 
sphere or on the ground. In addition to broad- 
scale efforts, if future impact assessments 
require separate evaluations, decisions must 
be based on merit, contribution to cumulative 
impact, and the best available science. 

Rescinding M-37017 goes beyond con- 
sideration of polar bears and sea ice—ad- 
hering to President Biden’s mandate for a 
“Government-wide approach that reduces 
climate pollution in every sector of the econ- 
omy” (SM). The role of the Arctic sea ice sys- 
tem in moderating global climate means that 
protecting polar bears from anthropogenic 
climate warming also will benefit the rest of 
life on Earth—including humans. 
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Saving mothers with a simple act 


A compelling new play revisits the discovery that 
drastically reduced maternal mortality 


By Stuart Firestein 
mell!” shouts the strange, diminu- 
tive man emphatically from the edge 
of the stage at the beginning of the 
new play Dr Semmelweis. He repeats 
it several more times—to himself, to 
the audience, to the other characters, 
to anyone who will listen—as he moves up- 
stage. This mystifying pronouncement is a 
deliberate foreshadowing of the events that 
are about to unfold, as the audience jour- 
neys alongside the man—soon identified 
as Ignaz Semmelweis, a 19th-century Hun- 
garian physician whose simple discovery 
would eventually change medicine forever. 
Co-written by Stephen Brown and the 
esteemed British actor Mark Rylance, the 
play—which also stars Rylance in the title 
role—is, among its many other values, an 
excellent play about science. It is to Semmel- 
weis, after all, that we owe the beginnings 
of germ theory and the once-radical idea of 
washing our hands. 
Semmelweis speaks with just the hint of 
a stutter and an Eastern European accent. 
Despite his position as assistant to the di- 
rector of the Vienna General Hospital, he is 
an outsider and suffers daily frustrations, 
small and large, as a result. 
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Semmelweis is obsessed with find- 
ing possible causes of “childbed fever,” a 
deadly infection that plagues an alarming 
number of postpartum women who give 
birth in hospital obstetric wards. He dis- 
misses variables one by one, but one curi- 
ous fact remains: Maternal death rates in 
doctor-run obstetric wards are three times 
that of patients whose births are adminis- 
tered by midwives. The doctors, however, 
find excuse after excuse for this anomaly. 

A janitor who has been clean- 
ing up after autopsies in the 
background throughout the en- 


Dr Semmelweis 


Physician Ignaz Semmelweis (left; played by Mar’ eee 
Rylance) makes his important discovery. 


tors, the audience learns as the second act 
opens, is that only the doctors perform au- 
topsies, and they typically do so in the morn- 
ings before going to the maternity ward to 
deliver babies. Washing one’s hands between 
doing autopsies and delivering babies proves 
transformative: The incidence of childbed fe- 
ver is miraculously reduced from as high as 
10% of patients to less than 1%. 

Compelling as the data are, the medical 
establishment is not convinced. Physicians 
scoff at the idea that such a simple practice 
could make a difference, and they resent 
the implication that they are in some way 
contaminated. 

Madness descends on Semmelweis as 
frictions between the doctor and his col- ,. 
leagues come to the surface. He is overcome 
by thoughts of the women who have died 
and the women who will soon be dead if 
his revelation is not embraced immediately. 
He calls eminent surgeons “murderers” and 
“assassins.” Hospitals are “slaughterhouses.” 
Everyone who fails to recognize the impor- 
tance of his discovery is “stupid.” Semmel- 
weis’s outbursts and impatience make him 
easy to dismiss. Remember, the microorgan- 
isms responsible for the infections, which 
he calls “cadaveric particles,’ have not yet 
been discovered. Is it ego or urgency that 
drives Semmelweis? Rylance and the script 
blur the distinction. 

Semmelweis is ultimately committed to 
an insane asylum, driven there by an over- 
whelming sense of the tragedy of a woman 
giving birth and dying soon after. Ironically, 
he dies during his confinement from sepsis 
due to an unwashed wound. 

This is no simple historical 
retelling; all of the techniques of 


tire play offers a hint. “I’ll mop Stephen Brown with the theater are put to use. There 
up and kill the stink” he tells aia ie bis is music—four violins and a * 
Semmelweis. The smell! “How London, UK, ’ cello—that drives the action for- 
do you kill the smell?” the physi- through 7 October 2023 ward. There are dancers, mostly 


cian inquires. The janitor points 
to his bucket of chlorine bleach. Semmelweis 
plunges his hands into the bucket, the first 
act of hygienic handwashing, as the first half 
of the play comes to a close. An audible sigh 
fills the darkened theater, conveying the au- 
dience’s palpable relief for the thousands of 
women who will subsequently be saved by 
this simple action. 

During the intermission, I overheard 
a fellow patron remark, “But how could 
they not have been washing their hands? 
Doesn’t everyone just know that?” How 
easily we forget that what seems like com- 
mon sense is often the result of long and 
painstaking scientific work. 

The difference between midwives and doc- 


women, who embody the victims 
of childbed fever and heighten the show’s 
sense of urgency. At two and a half hours, the 
play is substantial, but Rylance’s Semmelweis 
is riveting throughout—charming and funny 
one moment, obsessive and abusive the next. 
Semmelweis is a tragic character—a brave 
scientist who embraced data above all else 
and the mostly forgotten pioneer who first 
made the connection between microbes 
and disease. Yet his story is ultimately a tri- 
umph. His unyielding vision led to a simple 
solution that has reduced maternal mortal- 
ity worldwide. His life, and this play, are a 
scientific story worth celebrating. 
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Coexistence at the crossroads 


Ideology meets infrastructure for road ecologists helping 
flora and fauna live well in the presence of motorways 


By Sarah Boon 


hen we think of “road ecology,’ most 

of us think of wildlife crossings— 

the overpasses, underpasses, and 

fences, designed by engineers and 

biologists to connect fragmented 

habitats, into which animals are 
funneled. But road ecology, a relatively new 
field of science, is about so much more than 
just wildlife crossings. 

In Crossings: How Road Ecology Is 
Shaping the Future of Our Planet, Ben 
Goldfarb explores the science of road ecol- 
ogy and its impacts on our world. “It’s really 
interesting to meet someone in college who 
says, ‘I’m a road ecologist,” Washington State 
biologist Paul Wagner tells Goldfarb. “[I]t’s 
like, huh, I remember when we made that 
term up [in the 1990s].” 

Since that time, the field of road ecology 
has expanded exponentially, as evidenced 
by the sheer number of experts Goldfarb 
interviews. The scientists, engineers, policy- 
makers, and First Nations representatives 
he talks to are working to help all manner of 
flora and fauna—from mammals, to reptiles, 
to fish, to plants, to insects, and even hu- 
mans—better coexist with roads. Goldfarb’s 
many field excursions with these experts help 
readers experience road ecology firsthand, re- 
vealing how it started and where it is headed. 

Goldfarb contends that roads are more 
than mere infrastructure, they are an ideol- 
ogy that promotes sprawl, champions car 
traffic, and—when they are built in the 
backcountry—encourages human-animal 
interactions. “Like abortion or critical race 
theory, roads [have] become shibboleths, 
entrenched cultural markers that [distin- 
guish] warring factions,” he writes. “The 
practice of road ecology is not merely a set 
of engineering principles but a moral man- 
date.” Road ecologists and everyday people 
who rehabilitate injured wildlife found 
on roads and shepherd migrating animals 
across roads answer this mandate. 

Roads are problematic in many ways: 
The number of cars they can support and 
the driving speeds they enable lead to wild- 
life deaths, and the noise pollution that 
often accompanies them changes animal 
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activity such as birdsong. Roads can lead 
to habitat fragmentation, which isolates 
individual animals from the rest of their 
population. Contaminants released from 
tires and exhaust also affect the near-road 
environment. Meanwhile, remote wilder- 
ness roads enable hunting access, and the 
increased contact between humans and 
animals when new roads are built into un- 
tracked wilderness can enhance the spread 
of zoonotic diseases. 

“For all of human history, roads have been 
instruments of conquest,’ writes Goldfarb, 
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tioned them from white ones,’ he writes. 
Meanwhile, road noise and pollutants dis- 
proportionately affect low-income commun- 
ities and racial minorities. 

But not everything about roads is bad. 
Goldfarb finds that road verges—vegetation 
next to roads—can help preserve endan- 
gered habitats, such as prairies. Verges can 
also provide insect habitats and are less of a 
barrier, as they provide habitat that is paral- 
lel, rather than perpendicular, to roads. This . 
reduces the likelihood of insects being killed 
by vehicle traffic. 


A box turtle undertakes a perilous road crossing in the state of Georgia. 


but the Confederated Salish and Kootenai 
Tribes believe that “the road is a visitor” that 
should “respond to and be respectful of the 
land and the Spirit of Place.” Culverts that 
prevent salmon from spawning upstream 
affect Indigenous people for whom salmon 
is part of their creation story. New roads 
built in the Amazon connect “undiscovered” 
Indigenous groups with the outside world, 
often without their consent. 

Many roads, notes Goldfarb, were osten- 
sibly built to combat “urban blight” in areas 
that were, in reality, vibrant minority com- 
munities. “Freeways were the erasers with 
which cities rubbed out Black communi- 
ties and the walls with which they parti- 


Roads are growing faster than any other 
human infrastructure, and although road 
ecologists cannot stop road building, they 
can push to be involved in road design. Some 
strategies that would help mitigate the dam- 
aging effects of roads include designing some 
to be driven on more slowly, closing some at 
night, and incorporating wildlife overpasses 
and underpasses as needed. 

Crossings is a deeply researched and com- 
pelling read, enlivened with interviews and 
on-the-ground fieldwork. The book offers 
readers a look behind the scenes of a rich but 
underappreciated field of study that has the 
potential to affect our everyday lives. & 
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Retraction 


On 9 March 2001, Science published the 
Report “Binding of DCC by netrin-1 to medi- 
ate axon guidance independent of adenosine 
A2B receptor activation” (7). In 2015, Science 
agreed to the publication of an Erratum to 
address issues that had come to our atten- 
tion, specifically: in Figures 1 and 3, beauti- 
fication of some Western blots outside the 
data-containing portions (which was not for- 
mally disallowed at the time of publication) 
and some image breaks caused by format- 
ting (“tiling”) issues, for which we provided 
corrected images. However, because of an 
error on the part of the journal, the Erratum 
was not posted. In 2022, when new concerns 
were raised, Science posted an Editorial 
Expression of Concern (2) while an insti- 
tutional investigation was conducted. The 
investigation is complete and has identified 
further issues, including manipulation of 
data-containing portions of Western blot 
images in Figures 1, A and C, and 3, B and 
D, undermining confidence in the paper’s 
conclusions (3). As a result, we are retract- 
ing the paper. We regret the impact of these 
issues on the scientific community. Author 
Elke Stein disagrees with the decision to 
retract the paper. 

Yimin Zou, Mu-Ming Poo?, Marc Tessier-Lavigne?* 
University of California San Diego, San Diego, CA, 
USA. Institute of Neuroscience, Chinese Academy 
of Sciences, Shanghai, China. ?Stanford University, 
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When China’s first solar panel installations reach the end of their life spans, the country will need a plan to sustainably process t 
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Retraction 


On 8 February 2001, Science published the 
Research Article “Hierarchical organiza- 
tion of guidance receptors: Silencing of 
netrin attraction by Slit through a Robo/ 
DCC receptor complex” (7). In 2015, 
Science agreed to the publication of an 
Erratum to address issues that had come 
to the authors’ attention, specifically: in 
Figures 4, 5, and 6, beautification of some 
Western blots outside the data-containing 
portions (which was not formally disal- 
lowed at the time of publication) and some 
image breaks caused by formatting (“til- 
ing”) issues, for which I provided corrected 
images, and panel duplications in Figures 
2D (micrographs) and 4B (blank Western 
blots), and an incorrect blank panel in 
Figure 5E. However, because of an error on 
the part of the journal, the Erratum was 
not posted. In 2022, when new concerns 
were raised, Science posted an Editorial 
Expression of Concern (2) while an insti- 
tutional investigation was conducted. The 
investigation is complete and has identified 
further issues, including manipulation of 
data-containing portions of Western blot 
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images in Figure 4E and an additional panel : 
duplication in Figure 5C, undermining con- 
fidence in the paper’s conclusions (3). As a 
result, I am retracting the paper. I regret the 
impact of these issues on the scientific com- 
munity. Author Elke Stein disagrees with 

the decision to retract the paper. 

Marc Tessier-Lavigne 
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Green decommissioning 
for renewables in China 


Carbon neutrality depends on the devel- 
opment of renewables such as wind and 
solar energy (1, 2). China is implementing 
ambitious renewable energy plans, with 
the goal of exceeding 1.2 billion kilowatts 
of total installed capacity of wind and 
solar energy by 2030 (3). However, wind 
and photovoltaic equipment have limited 
life spans. Wind turbines and photovoltaic 
modules installed in China around 2000 
will soon reach the end of their life of 20 
to 25 years. At that time, China will need 
to determine an environmentally respon- 
sible way to decommission them (4). 
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INSIGHTS 


By 2040, the cumulative scale of decom- 
missioned wind and solar modules in 
China is estimated to reach 280 and 250 
GW, respectively (4). The decommissioning 
of wind and photovoltaic equipment will 
generate a large amount of solid waste, 
including blades, glass, and semiconductor 
materials (5). By 2028, China’s decommis- 
sioned wind turbines will generate approx- 
imately 74,000 tons of solid waste (6), 
and by 2050, the discarded photovoltaic 
modules in China will reach 20 million 
tons (7). Disposing of this solid waste in 
improper ways, such as burial or incinera- 
tion, will waste resources and pollute the 
soil, groundwater, and atmosphere (4, 8). 

China plans to prevent waste and pol- 
lution by recycling decommissioned wind 
turbines and photovoltaic modules (3, 

9), but more comprehensive action is 
required. An integrated plan should clarify 
decommissioning programs and actions 
for wind and photovoltaic equipment at 
both national and local scales. The plan 
must involve all relevant parties, including 
industry, academic institutions, investment 
institutions, and local stakeholders. China 
should also establish the circular economy 
that will be required to recover and reuse 
decommissioned wind turbines and photo- 
voltaic modules (J0) as well as the techni- 
cal specifications, policies, and regulations 
that will be required (17). As China pre- 
pares to manage decommissioned equip- 
ment, the government should lead invest- 
ments in the research and development of 
new energy materials and extend the life 
cycle of wind turbines and photovoltaic 
modules through technological innovation 
and green retrofitting. 

China’s wind and photovoltaic decommis- 
sioning will be concentrated in ecologically 
fragile areas (17). In addition to clarifying 
the logistics of retiring equipment, the coun- 
try should prioritize in-depth assessments 
of the impacts of decommissioned wind 
and photovoltaic equipment on ecosystems. 
If the environment suffers adverse effects, 
China should adopt management practices 
such as ecological restoration to minimize 
them. China can maximize the benefits of 
wind and solar energy by preparing green 
and sustainable decommissioning plans. 
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Strengthen management 
of offshore aquaculture 


China is accelerating toward deeper 
offshore aquaculture (7) in an effort to 
diversify the food supply system to ensure 
domestic food security in the face of 
unstable international markets. However, 
offshore aquaculture presents risks that 
differ from those in nearshore fisheries (2, 
3). The government and the aquaculture 
industry in China need to identify and 
address the unique challenges of offshore 
farms before expanding into deep water. 

As of 2022, China had 20,744 km? of 
marine aquaculture cover (4). Nearshore 
aquaculture, a mature and well-regulated 
industry (5), takes place in enclosed or 
semi-enclosed waters that are less than 20 
m deep and uses cages, rafts, and bottom- 
seeding to produce the desired products. 
In 2022, nearshore aquaculture produced 
more than 22 million metric tons of fish, 
shellfish, and aquatic plants (4). However, 
nearshore sites are fast approaching the 
farming capacity limit (6) as they com- 
pete for space with recreational fishing 
and tourism and suffer from pollution 
(5). Offshore farms have been proposed 
because they have better water quality and 
larger farming capacity (7). 

Offshore aquaculture takes place in 
open water deeper than 20 m and is domi- 
nated by cage culture. In 2022, offshore 
aquaculture production totaled just 0.4 
million tons (4). Still in its infancy, the 


offshore industry lacks efficient produc- 
tion chains and market mechanisms (7). 
Offshore farms require sophisticated 
infrastructure and equipment (2) and are 
more vulnerable than nearshore sites to 
natural disasters such as typhoons (3). 
Many nearshore sites produce introduced 
nonnative and migratory native species 
(8, 9), but farming such species in 
open water could pose risks that are 
not well understood. 

In June, China developed its first 
guiding opinion on the development 
of deeper offshore aquaculture (J0), 
but current policies focus on nearshore 
activities and may be insufficient for 
offshore farms. Before expanding off- 
shore aquaculture, China should refine 
regulations for managing ecological and 
environmental protection, financial credit, . 
policy-based insurance, and the selec- 
tion of aquaculture species. Additional 
research should be funded to establish 
environmentally safe offshore infrastruc- 
ture (3) and production procedures spe- 
cific to the offshore industry (10). National 
and regional communication and coopera- 
tion should promote the integrated devel- 
opment of the offshore production chain 
by sharing scientific, technological, and 
human resources. 
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Artist’s depiction of a black 
hole, with an associated 
star and accretion disk, 
outputting light at 

x-ray, optical, and radio 
wavelengths 


Magnetic field halts accretion 


s material falls toward a black hole, it forms an accretion disk that emits x-rays and optical 

light and sometimes also a jet that is visible at radio wavelengths. Theory predicts that if the 

disk contains a sufficiently strong magnetic field, then it can resist the gravitational pull of the 

black hole and temporarily halt the accretion process. You et al. compared observations of a 

transient black hole accretion event at x-ray, optical, and radio wavelengths. They found time 
delays between brightening at the different wavelengths and used models to show that this behav- 
ior is produced by a magnetically arrested disk. —KTS 
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is one of the most widely used often requires a trial-and-error 
A carbon-nitrogen reactions in pharmaceutical process to identify pertinent 
2 P research and manufacturing. optimal conditions. Rinehart 
coupling guide Nonetheless, it depends sensi- et al. trained and validated a 
The palladium-catalyzed cou- tively on the structure of the two machine learning model to pre- 
pling of amines with aryl halides coupling partners and therefore dict appropriate ligand, solvent, 
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lar reactant pairs. Ten products 
were isolated in more than 85% 
yield under the individualized 
conditions predicted by the 
model. —JSY 
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Two powerful quakes 
The Kahramanmaras earth- 
quake sequence in Turkey on 
6 February 2023 caused a 
tremendous amount of damage 
and loss of life. The sequence 
occurred across several faults, 
including and associated with 
the East Anatolian Fault, a 
strike-slip fault that has had : 
many major earthquakes in the 
past. Jia et al. used an array of 
geophysical observations to 
produce models of how the rup- 
tures occurred. The earthquake 
sequence ruptured at least six 
faults, including a large portion 
of the East Anatolian Fault. The 
rupture sequence was complex 
and contained surprises in 
the details of how the rupture 
occurred. These observations 
and models are important 
for understanding strike-slip 
faults and forecasting seismic 
hazards. —BG 

Science, adi0685, this issue p. 985 


Improving student ( 
success with engagement 


Across US universities, calculus 
is a gateway course for STEM 
degrees. Of all students who 
initially pursue STEM degrees, 
more than half graduate 
without one, often after strug- 
gling through coursework. 
Instructors defaulting to tradi- 
tional lecture-based instruction 
exacerbates disparities in 
failure rates; this disproportion- 
ately affects women, Hispanic, 
and Black students, depriving 
the workforce of talent and 
insights from diverse groups. 
Kramer et al. conducted a large 
trial that randomized students 
into calculus classrooms where 
instructors actively engaged 
students collaboratively (treat- 
ment) or relied on traditional 
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lecture styles that treated them 
as passive learners (control). 
Across demographic groups, 
the treatment was more effec- 
tive, as engagement fostered 
a deeper understanding of 
calculus, improved grades, 
and promoted the inclusion of 
underrepresented students. 
—EEU 
Science, ade9803, this issue p. 995 


A perceptual space 
map for smells 


For vision and hearing, there 
are well-developed maps that 
relate physical properties such 
as frequency and wavelength 
to perceptual properties such 
as pitch and color. The sense of 
olfaction does not yet have such 
a map. Using a graph neural 
network, Lee et al. developed 
a principal odor map (POM) 
that faithfully represents known 
perceptual hierarchies and dis- 
tances. This map outperforms 
previously published models 
to the point that replacing a 
trained human’s responses 
with the model output would 
improve overall panel descrip- 
tion. The POM coordinates were 
able to predict odor intensity 
and perceptual similarity, 
even though these perceptual 
features were not explicitly part 
of the model training. These 
results were used to build a vari- 
ety of olfactory predictions that 
outperformed previous feature 
sets even without fine-tuning. 
—PRS 

Science, ade4401, this issue p. 999 


Lazily sensing gravity 
The growth of plants in 
response to gravity relies on 
the sedimentation of starch 
granules called amyloplasts. 
Transduction of the sedimen- 
tation signal relies on both 
LAZY1-LIKE (LZY) proteins and 
redirection of auxin transport, 
but the links between each of 
these processes have not been 
fully established. Nishimura et 
al. found that LZY proteins at 
the plasma membrane relocate 
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to the amyloplast settling sites 
when the gravity direction is 
altered, indicating that these 
proteins transmit informa- 
tion about gravity direction 
to downstream signaling 
components. The experiments 
support the idea that gravi- 
sensing is carried out through a 
positional sensing mechanism 
rather than by sensing force. 
—MRS 

Science, adh9978, this issue p. 1006 


Alcohols keep the 
chains long 


Mass production of silicones 
proceeds by the steady growth 
of silicon—oxygen chains. 
However, there is a small 
fraction of these chains that 
inevitably bite back on them- 
selves late in the reaction to 
produce cyclic impurities. Shi 
et al. report that benzyl alcohol 
can complex with the chain end 
through hydrogen bonding and 
inhibit the back-biting process. 
Moreover, they describe a 
phosphonium counterion that 
is also stabilized by the alcohol 
but decomposes in its absence 
to deactivate chain growth, 
ikewise preventing the by- 
product formation. —JSY 
Science, adi1342, this issue p. 1011 


My oh myeloid 
Myeloid cell populations 
control immunosuppression 
in the tumor microen- 
vironment (TME), and 
targeting the TME is a promis- 
ing therapeutic strategy for 
immunosuppressed tumors. 
Juric et al. developed an 
afucosylated humanized 
monoclonal antibody against 
a proinflammatory receptor 
expressed on many myeloid 
cells. Treatment of a synge- 
neic mouse model with this 
antibody promoted anti- 
tumor efficacy, suggesting a 
promising strategy for future 
immunotherapy that requires 
further study. —DLH 
Sci. Transl. Med. (2023) 
10.1126/scitransImed.add9990 
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Segmented and reconstructed 


egmentation allows a visual system to distinguish 

which regions of a visual scene belong to the back- 

ground, which regions belong to different objects, 

and where the borders that constrain the regions of 

these objects are located. The brain uses a diversity of 
cues, including luminance, texture, disparity, and motion, to 
segregate objects from the background. Working with rhesus 
macaques, Hesse and Tsao used electrophysiology guided 
by functional magnetic resonance imaging to investigate 
whether the visual cortex has a functional organization for 
segmentation. They found that the visual cortex contains 
discrete modules spanning visual areas V2, V3, V3A, V4, 
and V4A for segmenting visual scenes. These segmentation 
modules are functionally distinct from both color-selective 
regions and other neighboring regions. —PRS 


Proc. Natl. Acad. Sci. U.S.A. (2023) 10.1073/pnas.2221122120 


Functional magnetic resonance imaging scan showing 


activation of the human visual cortex 


Adrenergic nerves and 


intestinal regeneration 
The intestine is a highly regen- 
erative organ. The epithelial 
lining serves as a barrier against 
pathogens, and its cells have a 
high turnover rate, consistently 
responding to wear and tear to 
regenerate. Although peripheral 
erves innervate the intestine, 


lial cell repair is currently unclear. 
Wang et al. studied adrenergic 
nerves and their influence on 

the renewal and regeneration 

of intestinal epithelium. Using 

a mouse model of irradiation- 
induced injury, the authors 
found that the density of gut 
adrenergic nerves increased after 
irradiation. Intestinal regenera- 
tion occurred by a neuroimmune 
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cross-talk mechanism, in which 
-adrenergic receptor signaling 
promoted interleukin-22 
production from type 3 innate 
lymphoid cells. —PNK 
Cell Stem Cell (2023) 
10.1016/j.stem.2023.07.013 


Vicarious shocks to 
mental health 


Can learning about or watch- 

ing someone else in your social 
network experience racism 
undermine your own mental 
health? A growing literature sug- 
gests that vicarious racism could 
be as impactful as direct racism, 
but we need to understand its 
prevalence and social patterning. 
Quinn et al. parsed the specific 
contributions of social contexts 


957 


RESEARCH 


ALSO IN SCIENCE JOURNALS siadbyaiieealbuck 


HEALTH 
What causes obesity? 


The incidence of obesity is 
increasing substantially. Obesity 
is recognized as a medical 
condition and as a risk factor for 
various other diseases. There 
are clearly genetic influences on 
the development of obesity, as 
well as environmental factors 
such as the amount and type 
of food intake. In a Perspective, 
Speakman et al. argue that we 
should approach obesity as 
arising from a combination of 
genes and environment rather 
than genes versus environ- 
ment. Considering these factors 
together highlights the mul- 
tifactorial influences on the 
development of obesity and 
raises many questions about 
its causes, which need to be 
addressed to better prevent and 
treat this disease. —GKA 
Science, adg2718, this issue p. 944 


ATHEROSCLEROSIS 
A protein that helps 


prevent heart disease 
Apolipoprotein B (apoB) is 
a protein that carries lipids 
and cholesterol. Lipoproteins 
composed of apoB and its lipid 
cargo play a key role in the 
pathogenesis of atherosclerosis. 
Tissue plasminogen activator 
(tPA), an anticoagulant protein 
that can be used to break down a 
blood clot causing a heart attack 
or stroke, inversely correlates 
with the amount of cholesterol- 
loaded apoB lipoproteins, but it 
was not clear why this inverse 
correlation exists. Dai et al. dis- 
covered that in both mouse and 
human cells, tPA produced by 
the liver binds directly to apoB, 
preventing it from being loaded 
with lipids and assembled into 
lipoproteins. Conversely, an 
inhibitor of tPA has the opposite 
effect, binding up free tPA and 
preventing its interaction with 
apoB. —YN 

Science, adh5207, this issue p. 959 
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PLANT MITOCHONDRIA 
Initiating protein 
translation 


Mitochondrial protein transla- 
tion machinery has diverged 
within groups of eukaryotes, 
with differing ribosome struc- 
tures and initiation sites. Tran 
et al. found that two proteins, 
mTRANI1 and mTRAN2, are 
required for translation initiation 
in Arabidopsis mitochondria. The 
mTRAN proteins form part of the 
ribosome small subunit and bind 
to motifs in the 5’ untranslated 
regions of mitochondrial genes, 
allowing protein translation to 
begin. This work offers insight 
into translation initiation in 
plants and highlights how dif- 
ferent mitochondrial translation 
mechanisms have evolved from 
acommon bacterial ancestor. 
—MRS 

Science, adg0995, this issue p. 960 


ARCTIC CLIMATE 
Going with the flow 


One of the reasons that Arctic 
sea ice has been disappearing 
over the past decades is that 
warm water from the Atlantic 
is being advected into the 
high-latitude ocean in increas- 
ing amounts, a process called 
“atlantification.” But what 
drives this process? Polyakov 
et al. show that the large-scale 
weather pattern called the Arctic 
Dipole causes atmospheric wind 
patterns that modulate North 
Atlantic inflows across the Fram 
Strait and within the Barents 
Sea, resulting in variations in 
Arctic Ocean circulation, fresh- 
water fluxes into the Amerasian 
Basin, ocean stratification, and 
heat fluxes (see the Perspective 
by Bacon). —HJS 

Science, adh5158, this issue p. 972; 

see also adj8469, p. 946 


HUMAN EVOLUTION 
Aclose call 


Today, there are more than 8 bil- 
lion human beings on the planet. 
We dominate Earth's landscapes, 
and our activities are driving 
large numbers of other species 
to extinction. Had a researcher 
looked at the world sometime 
between 800,000 and 900,000 
years ago, however, the picture 
would have been quite different. 
Hu et al. used a newly developed 
coalescent model to predict 
past human population sizes 
from more than 3000 present- 
day human genomes (see the 
Perspective by Ashton and 
Stringer). The model detected 
a reduction in the population 
size of our ancestors from about 
100,000 to about 1000 individu- 
als, which persisted for about 
100,000 years. The decline 
appears to have coincided with 
both major climate change and 
subsequent speciation events. 
—SNV 

Science, abq7487 this issue p. 979; 

see also adj9484, p.947 


EVOLUTIONARY BIOLOGY 
Not so different after all 


The rate of genetic mutation 
that occurs across generations 
is regularly used to estimate a 
wide array of measures, includ- 
ing past population sizes. It also 
varies depending on a suite of 
considerations, including gener- 
ation time and body size, and is 
notoriously difficult to estimate 
in wild animal species. Suarez- 
Menéndez et al. used a pedigree 
approach in four wild baleen 
whale species, producing a 
mutation rate different from that 
estimated using a phylogenetic 
approach (see the Perspective 
by Hoelzel and Lynch). The 
new rate is faster than previ- 
ously estimated for these large 
animals, being more consistent 
with primates and other smaller 
species with similar generation 
times. —SNV 

Science, adf2160, this issue p. 990; 

see also adk0121, p.942 
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CANCER STEM CELLS 
Breaking the cycle in 
leukemic stem cells 


Leukemic stem cells are 
resistant to various therapies 
and promote relapse in many 
chronic and acute leukemias. 
aef et al. identified a pathway 
that was necessary for stemness 
in leukemic stem cells but not 

in normal hematopoietic stem 
cells. Fusion proteins that drive 
eukemias promoted leukemic 
stem cell proliferation through a 
cell-autonomous loop involv- ‘ 
ing Wnt signaling and ST2, a 
receptor for the cytokine inter- 
eukin-33. ST2 was not detected 
on normal hematopoietic stem 
cells, indicating a potentially 
cell-targeted approach for more 
durable treatment outcomes in 
patients. —LKF 


Sci. Signal. (2021) 
10.1126/scisignal.add7705 


© 


T CELLS 
Replenishment of self- 


renewing T cells 


Stem-like CD8 T cells develop 
during periods of persistent 
antigen exposure. During chronic 
infection and cancer, therapies 
can stimulate the differentiation 
of antigen-specific stem-like 
CD8 T cells into cells with effec- 
tor function. However, factors 
controlling the integrity of the 
stem-like T cell pool have been 
unknown. Studying chronic viral 
infection in mice, Humblin et al. 
found that the costimulatory 
molecule CD28 regulated the 
metabolism of stem-like CD8 T 
cells expressing the inhibitory 
receptor PD-1 to promote self- 
renewal and differentiation. In 
companion work, Gill et a/. found 
that anti-PD-1 blockade, while 
promoting effector cell differen- 
tiation, likewise stimulated the 
self-renewal of these stem-like 
T cells, maintaining the size and 
functionality of the progenitor 
pool. Thus, immune check- 
point therapies may rejuvenate 
immune responses without 
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diminishing the durability of the 
stem-like T cells that respond to 
treatment. —SHR 


Sci. Immunol. (2023) 


10.1126/sciimmunol.adg0878, 
10.1126/sciimmunol.adg0539 


PHYSICAL CHEMISTRY 
Attochemistry of 
nuclear motion 


Photoionization by absorption 
of an extreme ultraviolet photon 
can result in ultrafast nuclear 
motion in the resulting cationic 
molecule. In joint experimental 
and theoretical work, Ertel et 

al. investigated how nuclear 
dynamics affect the outcome of 
isotopically different chemical 
reactions by steering the elec- 
tronic dynamics triggered by an 
attosecond pulse. The authors 
used methane containing hydro- 
gen (CH,) or deuterium (CD,), 
which present different nuclear 
dynamics after the absorption 
of an extreme ultraviolet photon. 
They suggest that the physical 
origin of the differences can 

be traced back to the nuclear 
motion or the nuclear autocor- 
relation function of a molecule 
after the absorption. -SMK 
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Sci. Adv. (2023) 
10.1126/sciadv.adh7747 
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lecture styles that treated them 
as passive learners (control). 
Across demographic groups, 
the treatment was more effec- 
tive, as engagement fostered 
a deeper understanding of 
calculus, improved grades, 
and promoted the inclusion of 
underrepresented students. 
—EEU 
Science, ade9803, this issue p. 995 


A perceptual space 
map for smells 


For vision and hearing, there 
are well-developed maps that 
relate physical properties such 
as frequency and wavelength 
to perceptual properties such 
as pitch and color. The sense of 
olfaction does not yet have such 
a map. Using a graph neural 
network, Lee et al. developed 
a principal odor map (POM) 
that faithfully represents known 
perceptual hierarchies and dis- 
tances. This map outperforms 
previously published models 
to the point that replacing a 
trained human’s responses 
with the model output would 
improve overall panel descrip- 
tion. The POM coordinates were 
able to predict odor intensity 
and perceptual similarity, 
even though these perceptual 
features were not explicitly part 
of the model training. These 
results were used to build a vari- 
ety of olfactory predictions that 
outperformed previous feature 
sets even without fine-tuning. 
—PRS 

Science, ade4401, this issue p. 999 


Lazily sensing gravity 
The growth of plants in 
response to gravity relies on 
the sedimentation of starch 
granules called amyloplasts. 
Transduction of the sedimen- 
tation signal relies on both 
LAZY1-LIKE (LZY) proteins and 
redirection of auxin transport, 
but the links between each of 
these processes have not been 
fully established. Nishimura et 
al. found that LZY proteins at 
the plasma membrane relocate 
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to the amyloplast settling sites 
when the gravity direction is 
altered, indicating that these 
proteins transmit informa- 
tion about gravity direction 
to downstream signaling 
components. The experiments 
support the idea that gravi- 
sensing is carried out through a 
positional sensing mechanism 
rather than by sensing force. 
—MRS 

Science, adh9978, this issue p. 1006 


Alcohols keep the 
chains long 


Mass production of silicones 
proceeds by the steady growth 
of silicon—oxygen chains. 
However, there is a small 
fraction of these chains that 
inevitably bite back on them- 
selves late in the reaction to 
produce cyclic impurities. Shi 
et al. report that benzyl alcohol 
can complex with the chain end 
through hydrogen bonding and 
inhibit the back-biting process. 
Moreover, they describe a 
phosphonium counterion that 
is also stabilized by the alcohol 
but decomposes in its absence 
to deactivate chain growth, 
ikewise preventing the by- 
product formation. —JSY 
Science, adi1342, this issue p. 1011 


My oh myeloid 
Myeloid cell populations 
control immunosuppression 
in the tumor microen- 
vironment (TME), and 
targeting the TME is a promis- 
ing therapeutic strategy for 
immunosuppressed tumors. 
Juric et al. developed an 
afucosylated humanized 
monoclonal antibody against 
a proinflammatory receptor 
expressed on many myeloid 
cells. Treatment of a synge- 
neic mouse model with this 
antibody promoted anti- 
tumor efficacy, suggesting a 
promising strategy for future 
immunotherapy that requires 
further study. —DLH 
Sci. Transl. Med. (2023) 
10.1126/scitransImed.add9990 
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Segmented and reconstructed 


egmentation allows a visual system to distinguish 

which regions of a visual scene belong to the back- 

ground, which regions belong to different objects, 

and where the borders that constrain the regions of 

these objects are located. The brain uses a diversity of 
cues, including luminance, texture, disparity, and motion, to 
segregate objects from the background. Working with rhesus 
macaques, Hesse and Tsao used electrophysiology guided 
by functional magnetic resonance imaging to investigate 
whether the visual cortex has a functional organization for 
segmentation. They found that the visual cortex contains 
discrete modules spanning visual areas V2, V3, V3A, V4, 
and V4A for segmenting visual scenes. These segmentation 
modules are functionally distinct from both color-selective 
regions and other neighboring regions. —PRS 


Proc. Natl. Acad. Sci. U.S.A. (2023) 10.1073/pnas.2221122120 


Functional magnetic resonance imaging scan showing 


activation of the human visual cortex 


Adrenergic nerves and 


intestinal regeneration 
The intestine is a highly regen- 
erative organ. The epithelial 
lining serves as a barrier against 
pathogens, and its cells have a 
high turnover rate, consistently 
responding to wear and tear to 
regenerate. Although peripheral 
erves innervate the intestine, 


lial cell repair is currently unclear. 
Wang et al. studied adrenergic 
nerves and their influence on 

the renewal and regeneration 

of intestinal epithelium. Using 

a mouse model of irradiation- 
induced injury, the authors 
found that the density of gut 
adrenergic nerves increased after 
irradiation. Intestinal regenera- 
tion occurred by a neuroimmune 
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cross-talk mechanism, in which 
-adrenergic receptor signaling 
promoted interleukin-22 
production from type 3 innate 
lymphoid cells. —PNK 
Cell Stem Cell (2023) 
10.1016/j.stem.2023.07.013 


Vicarious shocks to 
mental health 


Can learning about or watch- 

ing someone else in your social 
network experience racism 
undermine your own mental 
health? A growing literature sug- 
gests that vicarious racism could 
be as impactful as direct racism, 
but we need to understand its 
prevalence and social patterning. 
Quinn et al. parsed the specific 
contributions of social contexts 
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EXTREME RAINFALL 
Rising up all over 


esoscale convective systems, large, organized, long-lasting, and prodigiously precipitating events, occur 
worldwide and are responsible for major fractions of the rainfall in the areas where they take place. Li et al. 
report that these systems have become more frequent and intense in the East Asian rainband, accounting for 
most of the increase in rainfall that the region has experienced in the past 20 years. The authors attribute 
this pattern to anthropogenic temperature increases and expect it to continue as the climate continues to 
warm, with the consequent impacts on events such as extreme flooding. —HJS 


Geophys. Res. Lett. (2023) 10.1029/2023GL103595 


A mesoscale convective system over 
the South China Sea as seen from the 
International Space Station 


and demographic characteristics 
among African American adults. 
Findings differed by gender, and 
vicarious discrimination was more 
prevalent than direct discrimina- 
tion, particularly among educated 
groups. Compared with women, 
men reported higher levels of 
both vicarious and direct discrimi- 
nation, particularly in the context 
of police encounters. Women 
reported more direct discrimina- 
tion in their workplaces and high 
vicarious discrimination related 
to police. Therefore, research 
may be neglecting an important 
contributor to poor health for this 
stigmatized group. —EEU 
Soc. Sci. Med. (2023) 
10.1016/j.socscimed.2023.116104 


SIGNALING 
Protein degradation and 
transcription 


Metazoan development requires 
the coordination of protein 
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complexes that differ widely in 
stability and persistence. The E3 
ubiquitin ligase UBR5 is a mem- 
ber of a family of enzymes that 
target specific proteins for pro- 
teolysis and plays an important 
role in regulating gene expression 
in human stem cells. Mark et al. 
found that UBR5 monitors the 
dissociation of multiprotein tran- 
scription factor complexes that 
serve as hubs or networks con- 
trolling cell functions. In a network 
centered on the c-Myc oncopro- 
tein, dissociation of c-Myc and 
other subunits exposed motifs 
that were recognized by UBRS5, 
leading to subunit degradation 
at different rates. Such targeted 
degradation by UBR5 forces 
cycles of complex reformation 
that support dynamic regula- 
tion of transcription and ensure 
that stem cells are responsive 
to changes in their environment. 
—LBR 
Cell (2023) 
10.1016/j.cell.2023.06.015 
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PHYSICS 
Looking for bound 
magnons 


Floquet engineering, which relies 
on applying a judiciously chosen 
periodic series of pulses, has 
been used to generate various 
spin interactions in cold atom 
systems. Kranzl et al. used this 
method to create a long-range 
anisotropic Heisenberg model in 
an array of 20 trapped calcium 
ions. The researchers then 
searched for bound magnon 
excitations in this system, start- 
ing from a state in which two 
adjacent spins were “flipped” 
with respect to the rest. They 
found that bound magnons 
propagate with a finite velocity 
that depends on the interac- 
tion strength. For even stronger 
interactions, it may be possible 
to create multiple bound states 
using this technique. —JS 
Phys. Rev. X (2023) 
10.1103/PhysRevxX.13.031017 


GLYCOSYLATION 


Sugar-coated methylation 
O-linked glycosylation and 
methylation are dynamic post- 
translational modifications of 
proteins and DNA, respectively, 
that enable cells to encode an 
additional level of information 
in response to external stimuli. 
Shin et al. explored an intriguing 
link between these processes. 
In human cells and mouse liver 
samples, the DNA methyltrans- 
ferase DNMT-1 is glycosylated 
in response to elevated glucose. 
This modification reduces DNA 
methyltransferase activity and 
results in a global loss of DNA 
methylation, particularly at sites 
known as partially methylated 
domains. Using a mutant that 
is not subject to this regulatory 
mechanism, the authors showed 
that the loss of methylation under 
high-glucose conditions is detri- 
mental to DNA stability. -MAF 
eLife (2023) 10.7554/eLife.85595 
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PLANT MITOCHONDRIA 


An mTRAN-mRNA interaction mediates mitochondrial 
translation initiation in plants 


Huy Cuong Tran, Vivian Schmitt, Sbatie Lama, Chuande Wang, Alexandra Launay-Avon, Katja Bernfur, 
Kristin Sultan, Kasim Khan, Véronique Brunaud, Arnaud Liehrmann, Benoit Castandet, Fredrik Levander, 
Allan G. Rasmusson, Hakim Mireau, Etienne Delannoy, Olivier Van Aken* 


INTRODUCTION: Mitochondria are eukaryotic 
organelles required for energy conversion and 
metabolism. As mitochondria are likely rem- 
nants of bacteria that were once incorporated 
through endosymbiosis, they have retained 
many features similar to their bacterial an- 
cestors. Mitochondria in most eukaryotes have 
retained their own genome, which encodes a 
set of proteins required for mitochondrial ope- 
ration. During evolution many mitochondrial 
components have, however, diverged substan- 
tially between eukaryotic kingdoms, of which 
the mitochondrial ribosome (mitoribosome) is 
a prime example. How mitochondrial mRNAs 
are recognized by mitoribosomes to initiate 
protein translation in plants has been an 
enigma, especially because mitochondrial 
mRNAs lack bacterial-type Shine-Dalgarno 
ribosome-binding sequences. 


RATIONALE: We characterized two “proteins of 
unknown function” that were suggested to be 
part of plant mitoribosomes, using Arabidopsis 
thaliana (Arabidopsis) as a model system. To 
assess their functions, we produced plants with 
reduced expression of these proteins and 
assessed how their loss of function affected 
mitochondrial biogenesis and proteostasis. In 
addition, we searched the 5’ regions of mito- 
chondrial mRNAs for conserved motifs that 
could act as potential mitoribosomal bind- 
ing sites. 


RESULTS: We found that the identified genes, 
which we named “mitochondrial translation 
factors” mTRANI and mTRAN2, are conserved 
in land plants but absent in green algae, ani- 
mals, and fungi. mTRANs were exclusively 
found inside mitochondria, and when m7TRANI 
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mTRAN proteins mediate mitochondrial translation in plants. Plant mitochondria are required for 
energy metabolism, growth, and survival. Several proteins of the mitochondrial respiration chain and Adenosine 
triphosphate (ATP)-synthase are encoded by the mitochondrial genome and translated by mitoribosomes. 
Plant-specific mTRAN proteins are part of the plant mitoribosome “small” subunit (mtSSU). Our results indicate 
that mTRANs likely recognize A/U-rich sequence motifs in the 5' regions of mitochondrial mRNAs to mediate 
translation initiation. After mRNA-binding, the “large” mitoribosomal subunit (mtLSU) joins to start translation. fMet, 
N-formylmethionine; A, adenosine; U, uridine; G, guanine; C, cytosine. 


Tran et al., Science 881, 960 (2023) 


1 September 2023 


the plants showed growth reductions range 
from moderate growth inhibition to embryo 
lethality, indicating that they are essential 
for normal plant development. In the mtran 
double mutants, mitochondrial biogenesis and 
function were impaired, with reduced abun- 
dance and activity of all oxidative phosphoryl- 
ation complexes that contain mitochondrially 
encoded subunits (Complexes I, III, IV, and V). 
Using coimmunoprecipitation assays, we 
found that mTRANI and -2 are part of the 
plant mitoribosomal “small” subunit (mtSSU), 
in line with previous studies. Transcriptome 
analysis of mtran double mutants showed a 
rearrangement of nuclear, mitochondrial, and 
chloroplast gene expression that further sug- 
gested mitochondrial translation defects. In 
agreement, we showed that mtran double 
mutants have reduced mitochondrial protein _ 
synthesis rates. A large proportion of mitochon- 
drial mRNAs were not bound by mitoribo- 
somes in mtran double mutants, suggesting 
an important role for mTRANs in translation 
initiation. Structural modeling indicated that 
mTRAN proteins are alpha-solenoid proteins 
related to pentatricopeptide repeat (PPR) pro- 
teins and are thus potentially RNA-binding 
proteins. Because we could not find putative 
RNA motifs that could be bound by mTRANs 
using classical “PPR-code” detection tools, we 
searched the 5’ sequences of thousands of 
plant mitochondrial genes for conserved motifs. 
We identified A/U-rich motifs (CUUUxU and 
AAGAAx/AxAAAG) and showed that mTRANI1 
can directly bind these motifs in vitro and 
in vivo. Finally, using ribosome footprinting 
(Ribo-seq), we showed that mTRANs are re- 
quired for binding and translation of likely all 
plant mitochondrial mRNAs. 


CONCLUSION: This study establishes mTRAN 
proteins as factors required for mitochon- 
drial translation in plants. Our results in- 


dicate that mTRANs can bind conserved - 


A/U-rich motifs present in the 5’ regions 
of plant mitochondrial mRNAs, which may 
act as ribosome binding sites. mTRANs may 
thus act as “universal” homing factors to 
guide the mitoribosome to mitochondrial 
mRNAs and initiate translation. Plant mito- 
chondrial translation initiation therefore 
appears to use a protein-mRNA interaction 
that is divergent from bacteria or mam- 
malian mitochondria. 
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An mTRAN-mRNA interaction mediates mitochondrial 
translation initiation in plants 
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Plant mitochondria represent the largest group of respiring organelles on the planet. Plant 
mitochondrial messenger RNAs (mRNAs) lack Shine-Dalgarno-like ribosome-binding sites, so it is 
unknown how plant mitoribosomes recognize mRNA. We show that “mitochondrial translation 
factors” mTRAN1 and mTRANZ are land plant-specific proteins, required for normal mitochondrial 
respiration chain biogenesis. Our studies suggest that mTRANs are noncanonical pentatricopeptide 
repeat (PPR)-like RNA binding proteins of the mitoribosomal “small” subunit. We identified 
conserved Adenosine (A)/Uridine (U)-rich motifs in the 5’ regions of plant mitochondrial mRNAs. 
mTRAN1 binds this motif, suggesting that it is a mitoribosome homing factor to identify mRNAs. We 
demonstrate that mTRANs are likely required for translation of all plant mitochondrial mRNAs. 
Plant mitochondrial translation initiation thus appears to use a protein-mRNA interaction that is 
divergent from bacteria or mammalian mitochondria. 


Ithough most mitochondrial proteins are 
nuclear encoded, mitochondria have par- 
tially retained their genome and transla- 
tional machinery (J, 2). In plants, the 
number of unique mitochondrial proteins 
is estimated around 1000 to more than 2000 
(2, 3), but the mitochondrial genome only 
encodes 20 to 40 proteins (4, 5). Despite a bac- 
terial origin, mitochondrial translation is sub- 
stantially different from its bacterial counterpart 
(6-8). In yeast, mitochondrial translation in- 
itiation is regulated by general translational 
factors associated with the mitoribosome and 
mRNA-specific factors that bind 5’-untranslated 
regions (5' UTRs) to position translation ini- 
tiation sites (9, 10). Mammalian mitochon- 
drial mRNAs have absent or extremely short 
5' UTRs, and translation can start directly on 
non-AUG (adenine-uracil-guanine) start codons 
(I, 12). 
Bacterial-type Shine-Dalgarno ribosome- 
binding sequences are not found in flowering 
plant mitochondrial 5’ UTRs (13). Therefore, 
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there is poor understanding of how plant mito- 
ribosomes interact with 5’ UTRs of mitochon- 
drial mRNAs and how mitoribosomal small 
subunits (mtSSU) identify correct translation 
start codons. In contrast to other species, the 
plant mtSSU is larger than the mitoribosomal 
large subunit (mtLSU) (/4, 15). The plant mito- 
ribosome structure and core components have 
recently been studied (74-16), showing that 
plant-specific ribosomal pentatricopeptide re- 
peat (rPPR) proteins have become bona fide 
ribosomal constituents. PPR proteins contain 
repeats of 35 amino-acid tandem motifs, each 
forming two antiparallel «-helices, which in- 
teract with each other to generate a helix-turn- 
helix motif (17, 78). The helix-turn-helix PPR 
domains form a superhelix with a central 
groove, allowing PPR proteins to interact with 
RNA (i7, 19). 

We characterized Arabidopsis thaliana genes 
AT4G15640 and_AT3G21465—currently annotated 
as adenylyl cyclases (ACs), but also suggested to 
be mitochondrial ribosome components 7PPRIO/ 
mS83 (ribosomal pentatricopeptide repeat protein 
10) (14, 15). ACs catalyze conversion of ATP to 
3'-5'-cyclic adenosine monophosphate (cAMP)— 
a second messenger that affects different 
physiological and biochemical processes. ACs 
have been extensively studied in animals, but 
little is known about them in plants (20, 27). 
Our results indicate that AT4G15640 and 
AT3G21465 are not classical PPR proteins but 
are required for translation initiation by plant 
mitoribosomes, and as such we propose a new 
annotation for AT4G15640 and AT3G21465: 
MITOCHONDRIAL TRANSLATION FACTOR 
1 (mTRANI) and mTRAN3Q, respectively. 
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mTRANI and mTRANZ are plant-specific proteins 
Using a BLAST search, we found that mTRAN1 
(AT4G15640) and mTRAN2 (AT3G21465) are 
present only in Embryophyta (land plants). Pro- 
tein sequence alignment showed that mTRAN1 
and -2 are highly similar (84% identical), 
suggesting that they are paralogs in Arabi- 
dopsis thaliana. Some angiosperms, includ- 
ing Brassicaceae and Glycine max, contain 
two mTRAN proteins (Fig. 1A and fig. S1), in- 
dicating that the duplication event of mTRAN 
protein-encoding genes occurred relatively late 
in evolution, with the divergence of Fabaceae 
dating back around 70 million years (22). 
Physcomitrium patens, a nonvascular moss, 
also has two mTRAN proteins, suggesting that 
independent duplication events have occurred 
(23). The remaining land plant species exam- 
ined here contain only one mTRAN protein. 


mTRANI1 and mTRANZ are targeted 
to mitochondria 


Prediction analyses suggested that mTRAN1 is 
mitochondrially targeted (24), and this was sup- 
ported by mitochondrial complexome profiling 
(25). To confirm mTRAN mitochondrial local- 
ization, we stably expressed mTRANI-GFP and 
mTRAN2-GFP fusions in Arabidopsis Col-0 
plants. mTRANI-GFP and mTRAN2-GFP co- 
localized with the MitoTracker fluorescent dye 
that labels mitochondria (Fig. 1B). Addition- 
ally, we performed immunoblot analysis on 
purified cytosolic, mitochondrial, and chloroplas- 
tic fractions isolated from Arabidopsis seedlings 
expressing mTRANI-GFP or mTRAN2-GFP 
(Fig. 1C). mTRANI-GFP or mTRAN2-GFP could 
only be detected in the mitochondrial fractions, 
further confirming their mitochondrial location. 


The mtran mutants have a growth 
reduction phenotype 


mTRANI and mTRAN2 genes show relatively 
similar expression patterns during plant de- 
velopment (fig. $2). Of note, mTRAN proteins 
are expressed very early in seeds during ger- 
mination (one day after imbibition) (fig. S2), 
suggesting that they might be critical for plant 
development. Three independent knockout lines 
were obtained for each mTRAN gene (fig. S3A). 
We obtained double homozygous knockout 
plants for mtrani-1 x mtran2-1 (mtranI-1/2-1) 
and mtranl-2 x mtran2-2 (mtranI-2/2-2) and 
verified loss of mTRANI and mTRAN2 tran- 
scripts by quantitative reverse-transcription 
polymerase chain reaction (qRT-PCR) (fig. S3D). 
Only plants homozygous for either mtranI-3 
or mtran2-3 and hemizygous for the other 
mutations could be obtained, indicating that 
mtranI-3/2-3 was nonviable (fig. $3, B and C). 
The mtrani single mutants were smaller than 
the Col-0 wild type (WT), whereas the mtran2 
mutants appeared more similar in size to the 
WT (Fig. 1D). mtranI-1/2-1 was also smaller 
than the WT, whereas mtranI-2/2-2 showed 
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severe growth retardation. mtran single and 
double mutants were significantly slower to 
germinate (stage 0.5) and develop true leaves 
(stage 1.02/1.04) (Fig. 1E) (26). Primary roots of 
the mtran single and double mutants were 
significantly shorter than those of the WT 
(fig. S3E). Again, the double mutants showed a 
>80% reduction in primary root growth. Ad- 
ditionally, mtran single and double mutants 
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were significantly smaller and slower to flower 
than the WT (fig. S3F). mtranI-2/2-2 had a 
stronger phenotype compared with mtranI-1/ 
2-1 (Fig. 1, D to F). The milder phenotype of 
mtrant-1/2-1 is likely because of the presence 
of the T-DNA in the 10th intron of mTRAN1, 
potentially resulting in a partially active pro- 
tein. For m7TRANI, the T-DNA is inserted in 
the 5th and 7th exon, likely resulting in severe 
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growth deficiency and lethality of mtranI-2/2- 
2 and mtranI-3/2-3, respectively. 


mTRANI1 and mTRANZ are unlikely to be 
adenylyl cyclases 


mTRANI and mTRAN?2 are annotated as ACs, 
but we could not find experimental evidence 
nor a defined source for this annotation. To 
evaluate mTRANI/2 AC activity, we performed 
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a bacterial cAMP synthase complementation 
assay in the BTHIOI (cya) E. coli strain that 
lacks endogenous AC activity (27). The cells 
transformed with the positive control formed 
colored colonies on MacConkey/maltose and 
LB/X-gal media and could also grow on M63 
medium (fig. S4A). mTRANI and mTRAN2 
were cloned into the pUT18 vector and trans- 
formed into E. coli cya. On all tested media, 
pUT18-mTRANI and pUT18-mTRAN2 colo- 
nies had the same phenotype as the negative 
pUT18 control (fig. S4A). Thus, mTRANI and 
mTRAN2 do not appear to have AC activity in 
the bacterial system, though we cannot rule 
out a false negative result as E. coli may not 
provide the correct environment for plant ACs. 
Furthermore, there was no significant difference 
in cAMP concentration between the WT and 
mtran double-knockout mutants (fig. S4B) de- 
spite their clear growth retardation phenotype 
(Fig. 1, D to F), supporting the idea that mTRAN1 
and mTRAN2 are unlikely to be ACs. 


Mitochondrial oxidative phosphorylation 
complexes show reduced activity in mtran 
double mutants 


To find the cause for slow growth of mtran 
double mutants, mitochondria isolated from 
mtranti-1/2-1 and mtranI-2/2-2 were analyzed 
by blue native polyacrylamide gel electropho- 
resis (BN-PAGE). The mtran double mutants 
showed a decrease in abundance of supercom- 
plex I/III and complexes I, III, and V compared 
with the WT (Fig. 2, A and B). Furthermore, 
the activity of complexes I, III, IV, and V was 
reduced in the mtran double mutants, espe- 
cially mtranI-2/2-2 (Fig. 2C). Freshly isolated 
mitochondria from mtranI-1/2-1 and mtranI- 
2/2-2 were used for oxygen consumption mea- 
surements (Fig. 2D). The state III respiration 
rates (succinate+ADP+NADH) of mtran dou- 
ble mutants were significantly lower com- 
pared with the WT. The mtran double mutants, 
however, had a significantly higher alterna- 
tive oxidase (AOX) pathway capacity (KCN+ 
dithiothreitol+pyruvate). Additionally, the 
ratio of AOX capacity to state III was signif- 
icantly higher in the mtran double mutants, 
suggesting that they rely heavily on alterna- 
tive respiration. 


Loss of mTRAN1 and mTRAN2 causes changes in 
abundance of mitochondrial proteins 


In the mtran double mutants, especially 
mtranti-2/2-2, proteins from complexes I, III, 
IV, and V were less abundant (Fig. 2E), in 
agreement with the decrease in abundance 
and activity of these complexes (Fig. 2, A to C). 
Mitochondrially encoded mitoribosomal sub- 
units RIBOSOMAL PROTEIN S4 (RPS4) and 
RIBOSOMAL PROTEIN L16 (RPL16), and nu- 
clear encoded mitochondrial translation fac- 
tor LEUCINE ZIPPER-EF-HAND-CONTAINING 
TRANSMEMBRANE PROTEIN 1 (LETM1) were 


Tran et al., Science 381, eadg0995 (2023) 


more abundant in the mtran double mutants. 
Nuclear-encoded mtSSU subunit RIBOSOMAL 
PROTEIN S10 (RPS10) did not show clear 
differences in abundance in the mutants. 

Proteins encoded by mitochondrial retro- 
grade signaling marker genes, including 
ALTERNATIVE OXIDASE (AOX), UP REGU- 
LATED BY OXIDATIVE STRESS (UPOX), and 
TRANSLOCASE OF THE INNER MEMBRANE 
17 (TIM17) (28, 29), were more abundant in 
both double mutants. This indicates that loss 
of mTRANI and mTRAN2 induced mitochon- 
drial retrograde/unfolded protein response 
(UPR™) signaling (Fig. 2E). Up-regulation of 
AOX protein is consistent with the reliance of 
mtran double mutants on alternative respira- 
tion (Fig. 2C). 

To assess global changes in mitochondrial 
protein abundance, we analyzed isolated mito- 
chondria with tandem mass spectrometry 
(MS/MS) (data S1). We detected mTRANI in 
the WT and mtranI-1/2-1 mitochondria, with 
comparable abundance. However, mTRANI 
was not detected in mtranI-2/2-2. This is con- 
sistent with our hypothesis regarding mTRAN1 
activity and the location of the T-DNA insertion 
in mTRANI. mTRAN2 could not be detected in 
any of the genotypes, suggesting that mTRAN1 
is much more abundant than mTRAN2, which 
is in agreement with a recent study (3). This 
may explain why we found 684 and 24 pro- 
teins to be significantly different in abundance 
in mtrani-2/2-2 and mtrant-I/2-1 (Paaj < 0.2), 
respectively, compared with the WT. Using a 
less-stringent unadjusted P < 0.05, 247 pro- 
teins were found to be differentially abundant 
in the mtranI-1/2-1 mitochondria, represent- 
ing subunits of complexes I, ITI, IV, and V, in 
agreement with immunoblot results (Fig. 2E). 
Of the 684 proteins affected in mtranI-2/2-2, 
mitochondrially encoded oxidative phospho- 
rylation (OXPHOS) proteins, including NADH 
DEHYDROGENASE 1, 7, and 9 (NadI, Nad7, and 
Nadg), APOCYTOCHROME B (Cob), CYTO- 
CHROME OXIDASE 2 (Cox2), and ATP SYN- 
THASE SUBUNIT 1, 4, and 8 (Atp1, Atp4, and 
Atp8), were less abundant. However, sub- 
units of complex II, all of which are nuclear- 
encoded, were not affected in the mutants. 
Mitochondrially encoded RIBOSOMAL PRO- 
TEIN L5 (RpI5) and Rpl16, and nuclear-encoded 
AOX1A, AOXI1D, and LETM1 were more abun- 
dant. This is consistent with the results of our 
immunoblot analysis. Furthermore, heat shock 
proteins, adenosine diphosphate/adenosine tri- 
phosphate (ADP/ATP) carrier proteins, ALTER- 
NATIVE NAD(P)H DEHYDROGENASES NDAI/ 
NDB2, proteases, mitochondrially targeted ribo- 
somal proteins, PPR/MORF proteins, ovule 
abortion proteins, and amino acid-tRNA lig- 
ases were more abundant. In conclusion, loss 
of both mTRAN proteins results in severe per- 
turbation of the mitochondrial proteome and 
reduction of many OXPHOS components. 


1 September 2023 


mTRAN1 and mTRANZ are part of the 
mitoribosome small subunit 

To identify mTRANI/2-interacting proteins, 
plants expressing mTRANI-GFP or mTRAN2- 
GFP were used for coimmunoprecipitation 
(co-IP) followed by MS/MS. A mitochondria- 
targeted GFP line (mito-GFP) (30) was used as 
a negative control. The phenotypes of comple- 
mentation lines mtranI-2/2-2 35S:mTRANI-GFP 
and mtrantI-2/2-2 35S:mTRAN2-GFP were 
similar to the WT (fig. S5), indicating that 
mTRANI-GFP and mTRAN2-GFP proteins are 
fully functional. A summary of proteins co- 
immunoprecipitated with mTRANI-GFP and 
mTRAN2-GFP is presented in data S2A. We 
found that mTRANI and mTRAN2 interacted 
with 27 components of the mtSSU, as pro- 
posed by previous work (/4, 15). By contrast, 
no mitoribosome core subunits were pulled 
down using the mito-GFP control, indicating | 
the specificity of the interactions. This indi- 
cates that mTRANI and mTRAN2 are part of 
the mtSSU. All ribosomal proteins interacting 
with mTRAN proteins belonged to the mtSSU, 
except mitochondrial ribosomal protein L11, 
which is part of the mtLSU. mtLSU L11 was, 
however, only identified in the first replicate. 
Most rPPR proteins found in Waltz et al. (15) 
and Rugen et al. (14) interacted with mTRAN 
proteins, though rPPR3a (AT1G55890) only 
coimmunoprecipitated with mTRANI1. An in- 
dividual mitochondrion contains on average 
around 520 copies of mtSSU proteins (3), 
which is very close to the estimated 534 copies 
found for mTRANI, supporting the idea that 
mTRANS are integral parts of the plant mito- 
ribosome. We also found proteins interacting 
with mTRAN proteins that were only detected 
in our analysis (data S2A, gray color), includ- 
ing three additional mitochondrially targeted 
ribosomal proteins, a PPR protein (AT3G61520) 
and MORF8. mTRANI and mTRAN2 also inter- 
acted with mitochondrially encoded Nad7 and 
Nad9, which are less abundant in the mtran 
double mutants (Fig. 2 and data S1). Overall, 
our co-IP results demonstrate that mTRANI/2 
are part of the mtSSU, suggesting that they 
might be involved in mitochondrial translation. 


Transcriptome analysis of mtran1-2/2-2 
indicates mitochondrial translation defects 


RNA-seq analysis was carried out on the WT 
and mtranI-2/2-2 (Fig. 1, D to F). 2143 nuclear- 
encoded differentially expressed genes (DEGs; 
P,aj<0.05, fold change >2 or <0.5) were found 
in mtranI-2/2-2 versus the WT, of which 1117 
were up-regulated and 1026 were down- 
regulated (data S3). Gene ontology analysis 
showed that stress-responsive genes were up- 
regulated whereas oxygen level-, hypoxia-, and 
hormone (auxin)-responsive genes were down- 
regulated (data S4). Mitochondrial retrograde 
signaling marker genes such as AOX/Ja were 
significantly up-regulated (data $3), indicating 


3 of 12 


RESEARCH | RESEARCH ARTICLE 


Fig. 2. mtran double mutants 
show defects in mitochondrial 
biogenesis, protein content, 
and function. (A) Analysis of 
abundance of mitochondrial 
complexes by Coomassie-Colloidal 
stained-BN-PAGE. Arrows indi- 
cate respiratory complexes and 
supercomplexes. The molecular 
weight (kDa) is shown on the right 
side of the gel. (B) Western blot 
using anti-RISP antibody for native 
complex III on mitochondrial pro- 
tein extracts of wild-type and 
mtran double mutant separated 
by BN-PAGE. (C) Activity mea- 
surement of the respiratory 
complexes |, Ill, IV, and V in 
BN-PAGE. Arrows indicate respi- 
ratory complexes and super- 
complexes. I+lllo, supercomplex 
l+lllo; CIIl, complex Ill; CIV, 
complex IV; CV, complex V; Fi, 
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outer mitochondrial membrane. 


that loss of mTRANI and mTRAN2 activates 
UPR™ signaling likely through ANACO17 (32). 
We confirmed up-regulation of UPR™ marker 
genes in mtranI-2/2-2 and mtranI-1/2-1 by qRT- 
PCR (fig. S6A). 

Next, the RNA-seq dataset of mtranI-2/2-2 
was compared with the transcript profiles of 
22 additional Arabidopsis mutants and chem- 
ical treatments affecting mitochondrial func- 
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mtran1-1/2-1 mimtran1-2/2-2 


tion (Fig. 3A and data S5A). The similarities 
among all datasets were evaluated by com- 
parison of common pairwise DEGs calculated 
through the Sgrensen-Dice similarity coeffi- 
cient (DSC) (data S5B), followed by hierarchi- 
cal clustering of the complete DSC matrix (Fig. 
3A). mtranI-2/2-2 clustered most closely with 
the rpsi0 P2 and P3 mutants, in which transcript 
levels of mtSSU-subunit RPSIO are reduced (32). 
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This further supports that mTRANSs are in- 
volved in mitochondrial translation. mtranI-2/ 
2-2 also clustered relatively closely to anti- 
mycin A (AA) (inhibitor of complex ITI), oligo- 
mycin (inhibitor of complex V), and rotenone 
Gnhibitor of complex I) datasets, in line with 
the observed loss of activity of these complexes 
in mtran double mutants (Fig. 2). We then 
visualized the similarities of nuclear transcript 
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Fig. 3. Loss of mTRAN proteins A 0 
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expressed genes (DEGs) among 
the transcriptomic datasets 
analyzed by the S@rensen-Dice 
similarity coefficient (DSC). 

The complete matrix of DSC 
values for mtranl-2/2-2 and 22 
additional datasets was hierarchi- 
cally clustered using euclidean 
distance. The subbranch including 
mtranl-2/2-2 is highlighted for 
clarity. Color bar indicates linear 
fold change. (B) Autoradiogram 

of in organello protein synthesis for 
10, 30, and 60 min using purified Cc 


47208729 


(C) Polysome profiling for 
mitochondrially encoded tran- 
scripts: OXPHOS subunits, 
mitoribosomal subunit RPS4 

and mitochondrial rRNAs rrn18/ 
rm26, and nuclear-encoded 
transcript UBC21. Percentage 

of total mRNA in all fractions was 
measured by qRT-PCR. 


Percentage of total RNA present in all fractions 
3 


profiles among mtrantI-2/2-2, rps10 mutants, 
and prohibitin atphb3 mutant (29), which did 
not cluster closely to mtranI-2/2-2 (fig. S6B). 
mtrani-2/2-2 shared 326 and 651 DEGs with 
rps10 P2 and P3, respectively, whereas 106 
common DEGs were found between mtranI- 
2/2-2 and atphb3. 288 DEGS were commonly 
found between mtranI-2/2-2 and both rps10 
mutants. 31 DEGs were common among all 
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genotypes, including many UPR™ markers 
such as AOXI7a (fig. S6C). 

We also analyzed mtranI-2/2-2 mitochon- 
drial and chloroplast transcript profiles (data S6). 
A number of mitochondrially encoded genes 
(57) were significantly up-regulated (FC>1.4, 
P,aj<0.05), with fold change up to ~4.39 for 
trnY (data S6A). Mitochondrial transcription 
thus seems globally increased, which may be a 
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compensation response to reduced translation. 
All six genes induced more than four times en- 
coded tRNAs, further suggesting an attempt to 
compensate for reduced translation. The mito- 
chondrial transcriptome was further com- 
pared with mitochondrial/chloroplast RNA 
polymerase rpotmp and atphb3 mutants (29, 33). 
In rpotmp and atphb3 mutants a mix of up- 
and down-regulated genes could be observed, 
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whereas in mtranI-2/2-2 nearly all genes were 
up-regulated (fig. S6D and data S6C). By con- 
trast, all 117 annotated chloroplast-encoded 
genes were slightly down-regulated (P,4)<0.05) 
(data S6B). In conclusion, loss of mTRANI/2 
resulted in increased expression of nearly all 
mitochondrially encoded genes, suggesting 
that the cells attempted to compensate for re- 
duced mitochondrial translation. 

Next, we analyzed the mitochondrial tran- 
scriptome for alterations in splicing and edit- 
ing (data S7). The rate of C-to-U editing was 
significantly different between mtranI-2/2-2 
and the WT for 262 cytosines (FDR < 0.05) 
(data S7A). Most (208) of these editing sites 
were relatively less edited in the mutant than 
in the WT (data S7A). However, considering 
that mitochondrial transcripts were more abun- 
dant in mtranI-2/2-2, for 244 differentially 
edited sites there were still more edited tran- 
scripts in absolute terms in mtranI-2/2-2 com- 
pared with that in the WT. We therefore suggest 
that there are more than enough correctly 
edited mitochondrial transcripts in mtranI-2/ 
2-2 mitochondria to allow production of the 
correct proteins. For six mitochondrial tran- 
script splice sites, a significant difference was 
observed in the mutants versus the WT (FDR 
< 0.05) (data S7B). Also here, for all transcripts 
there was a higher absolute number of spliced 
transcripts in the mutant, indicating that the 
lower abundance of many mitochondrially en- 
coded proteins (Fig. 2) is not caused by a lack 
of correctly spliced/edited mitochondrial tran- 
scripts. The observed similarities in nuclear 
transcriptome between mtranI-2/2-2 and rps10 
mutants, along with excessive expression of 
mitochondrial transcripts, thus point to a de- 
fect in mitochondrial translation. 


mTRANI and mTRANZ are required for 
translation in mitochondria 


To validate whether mitochondrial translation 
was indeed affected in the mtran double mu- 
tants, we performed in organello protein syn- 
thesis assays using freshly isolated mitochondria 
from Arabidopsis seedlings. Synthesized radio- 
labeled mitochondrial translation products were 
evaluated after 10, 30, and 60 min of translation 
(Fig. 3B and fig. $7). Sodium acetate, a sub- 
strate only for bacterial translation, was used to 
evaluate bacterial contamination during mito- 
chondrial purification. We could identify sev- 
eral distinct protein bands, including Atp1, 
Cob, Cox2, and Atp9, based on their molecular 
mass (34, 35). Atp1, Cob, Cox2, and Atp9 had a 
slower rate of translation in the mtran double- 
knockout mutants than in the WT. This was 
consistent with our immunoblot and MS/MS 
analysis (Fig. 2E and data S1), in which we 
observed a reduction in the abundance of Atp1 
and Cox2 in the mtran double-knockout mu- 
tants. Thus, mTRANI and mTRAN2 are required 
for efficient translation in mitochondria. 
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The efficiency of translation depends on 
both translation initiation and elongation. If 
translation initiation is impaired, one would 
expect fewer ribosomes attached to a given 
mRNA. If translation elongation is defective, 
one would expect more ribosomes attached 
to the mRNA, due to difficulties in completing 
translation. To allow us to distinguish be- 
tween these two possibilities in the mtran 
double mutants, we assessed ribosome load- 
ing along mitochondrially encoded transcripts 
in mutants and the WT (Fig. 3C). Polysomes 
are composed of mRNAs bound to two or 
more ribosomes, whereas monosomes consist 
of mRNAs bound to a single ribosome and/or 
“vacant couples,” which are stable associations 
of small and large ribosomal subunits without 
binding mRNAs (36). In this analysis, poly- 
somes were separated from monosomes and 
free mRNAs by sucrose gradient ultracentrifu- 
gation. After centrifugation, we collected 10 
fractions along the gradient (fractions 1 to 10 
from light to heavy) and performed qRT-PCR 
to determine the percentage of mitochondrial 
mRNA present in each fraction (Fig. 3C). Over- 
all, all analyzed mitochondrially encoded 
mRNAs of both mtranI-1/2-1 and mtranI-2/ 
2-2 were distributed mainly in fractions 2 and 
3, whereas mRNAs in the WT were mostly 
found in fractions 3 and 4 (Fig. 3C). These 
shifts toward the lighter fractions indicate that 
the number of ribosomes associated with the 
mRNAs was lower in the mtran double mu- 
tants, suggesting inefficient translation initiation/ 
mitoribosomal binding to mRNAs. To deter- 
mine the distribution of the mitoribosomes, 
we analyzed 18S rRNA (7rn18) of mtSSU and 
26S rRNA (rrn26) of mtLSU. The rrni8 of 
the WT was mainly found in fractions 3 and 4 
whereas 77726 was mainly in fraction 4, colo- 
calizing with protein-encoding mRNAs (Fig. 
3C). This indicates that most WT mRNAs are 
bound by mitoribosomes and are actively trans- 
lated. In mtranI-2/2-2, rrn18 was still present 
mainly in fractions 3 and 4, as observed in the 
WT. rrn26 was, however, spread out over 
lighter fractions 2 to 4, suggesting that mtLSU 
is not efficiently translating in mtranI-2/2-2. 
In mtranI-1/2-1, the mitoribosomes were main- 
ly found in the heavy fractions 8 and 9, and to 
a lesser extent in fractions 3 and 5, as opposed 
to in the lighter fractions 2 to 4 as observed in 
mtrani-2/2-2. 

In all cases, the mRNA levels in fraction 2 
were much higher in the mutants than in the 
WT. Because fraction 2 contains almost no 
mitoribosomes according to our analysis (Fig. 
3C and fig. S8), our results indicate that a large 
proportion of mitochondrial mRNAs are likely 
devoid of ribosomes in the mtran double mu- 
tants. In mtranI-1/2-1, a small percentage of 
several mRNAs was found in fractions 8 and 9, 
indicating that they are heavily translated by 
polysomes (see discussion). However, no obvi- 
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ous effect of loss of mTRANSs could be detected 
on cytosolic translation, as assessed by ribo- 
some loading of nuclear-encoded UBIQUITIN- 
CONJUGATING ENZYME 21 (UBC21). Together, 
our findings provide evidence that mnTRAN 
proteins are key components of plant mito- 
ribosomes required for translation initiation. 


Structural modeling suggests that mTRAN 
proteins have PPR protein-like structures 


Because mTRAN proteins are part of the 
mtSSU (data S2) and required for translation 
(Fig. 3), we assessed predicted protein struc- 
tures to anticipate whether the proteins could 
bind mitochondrial mRNAs to help mtSSU ini- 
tiate translation. Structures of mTRAN proteins 
without the predicted mitochondrial target- 
ing sequences (MTS) (37) were modeled using 
AlphaFold (38), RoseTTAFold (39), and iTASSER 
(40) (fig. S9). The modeled partial mTRAN1 | 
structure extracted from the cryo-EM structure 
of the mitoribosome from Brassica oleracea 
var. botrytis (cauliflower) (16) is also presented 
for comparison (fig. S9). Overall, the protein 
structures shown by prediction tools and 
Waltz et al. (16) suggest that mTRAN1 and 
mTRAN2 have tetratricopeptide-repeat (TPR)/ 
PPR protein-like structures. PPR proteins are 
known as RNA binding proteins involved in 
RNA processing, splicing, stability, editing, and 
translation (19, 41). The three programs pre- 
dicted that mTRAN1 and mTRAN2 contain 8 
to 10 PPR-like repeats. The cryo-EM structure 
of the plant mitoribosome by Waltz et al. (16) 
proposed that mTRANI is formed by 6 PPR- 
like repeats, based on the number of PPR re- 
peats predicted by “TPRpred” software (42). 

We then attempted to find potential mTRAN- 
RNA target sites using aPPRove (43) and the 
PPRCODE prediction server (44) but were un- 
able to find potential RNA binding sites for 
mTRANI and mTRAN2. This suggests that 
mTRANs belong to a PPR-like protein class 
that does not obey the standard PPR code and 
may bind RNA differently. 


mTRANI binds to putative mitoribosome 
binding sites in mitochondrial 5' UTRs 


As potential RNA binding sites could not be 
predicted for mTRANs using the PPR code, we 
searched for potential conserved mitoribosome 
binding sites in the 5’ UTRs of mitochondrial 
mRNAs. Using the Multiple Expectation- 
maximization for Motif Enrichment (MEME) 
motif search tool, we identified a potential 
CUUUxU-like mitoribosome binding site in the 
5’ UTRs of 26 mitochondrial mRNAs (Fig. 4A, 
fig. S10, and data S8, B and C). Waltz et al. (16) 
suggested that the plant mitoribosome might 
recognize an AxAAA-related motif, which is 
located 19 bases upstream of the start codons 
in 17 mitochondrial mRNA 5’ UTRs. The MEME 
motif search tool was unable to identify the 
AxAAA-related motif, but we found a similar 
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Fig. 4. mTRANI binds potential A 


mitoribosome binding sites 
in vitro. (A) The motif analysis 
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AAGAAx-like motif in 30 5’ UTRs when in- 
cluding the intercistronic UTRs (Fig. 4A, fig. 
$10, and data S8C). Furthermore, we manually 
found an overlapping AxAAAG-like motif in 
the 5’ UTRs of 17 mitochondrial mRNAs (Fig. 
4A, fig. S10, and data S8, B and C). In sum- 
mary, we identified at least one CUUUxU or 
AAGAAx/AxAAAG motif in the 5’ regions of all 
30 mitochondrial mRNAs (fig. S10). 
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cox2 


Next, we searched the 5’ upstream genomic 
regions for hexamers enriched at any given 
position in 49706 mitochondrially encoded 
genes of seed plants, 4256 of mosses, 2659 of 
liverworts, and 7867 of green algae, using 
“PLMdetect” (45, 46) (fig. S11). In seed plants, 
there is a preference for enriched motifs around 
position —25 upstream of the start codon, but 
the location of the motifs is more variable, ex- 
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cob/rpI5 


tending beyond 100 bases upstream and into 
the coding sequence. Similar diffuse prefer- 
ences were observed in mosses and liverworts, 
with peaks situated around —25/—28 or -27, re- 
spectively. Analysis of 7867 green algae mito- 
chondrial gene 5’ genomic regions revealed an 
extremely tight peak at —25 and nearly no signal 
elsewhere (fig. S11). The sequences of the motifs 
enriched around —25 in seed plants were A-rich 
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and related to AAGAAA/AGAAAA, similar to 
those we defined in Arabidopsis (fig. S11 and 
data S9). Over half of the tested seed plant 5’ 
regions had such a motif around the —25 po- 
sition. Also in mosses and liverworts, AAAAAA/ 
AAAAAG-like motifs were present in the major- 
ity of 5’ UTRs around —25/—28 or —27, respectively. 
In green algae almost complete conservation 
of strictly A/U containing hexamers was found 
around —25 of the start codon (fig. S11 and data 
89). Being based on genomic data, the pres- 
ence of these motifs in mature mRNAs re- 
quires further validation. The CUUUxU motif 
was found in all lineages in 3 to 4% of the 
genes with a different preferential position: 
—28 in seed plants and -69 in green algae. Over- 
all, A/U-rich motifs appear to be a common 
feature in most 5’ sequences of plant mito- 
chondrial mRNAs, suggesting that they may 
serve as ribosome binding sites. 

As mTRAN1 is the main isoform in Arabidopsis, 
we performed RNA electrophoretic mobility 
shift assays (REMSAs) to test whether mTRAN1 
can bind to the CUUUxU or AAGAAx/AxAAAG 
motifs identified in 5' UTRs of mitochondrial 
mRNAs (Fig. 4 and fig. $12). Synthesized Cy5- 
labeled RNA probes, containing one or more of 
the predicted mitoribosome binding motifs in 
the 5’ UTRs of nad9, nad7, cox2, cox3, and rpl5/ 
cob were obtained (Fig. 4, A and B). Mutated 
probes were also synthesized to verify the spe- 
cificity of mTRAN1 binding. The REMSAs 
showed that mTRANI could bind the WT nad, 
nad7, cox2, rpl5/cob and cox3 probes specifi- 
cally, whereas the interactions were lost when 
the predicted CUUUxU/AAGAAx/AxAAAG mo- 
tifs were specifically mutated (Fig. 4C). Al- 
though a weak shift was observed in the nad7 
mutated probe, it was clear that mTRAN1 
bound more strongly (lower Kg) to the WT 
than mutated probe (Fig. 4C and fig. S12, C and 
D). The binding of mTRANI appeared stronger 
dower Kg) to AAGAAx/AxAAAG motifs than to 
CUUUxU motifs (fig. S12, C and D). 

To establish whether mTRANI binds the 5’ 
UTRs of mitochondrial mRNAs in vivo, we 
performed RNA immunoprecipitation sequenc- 
ing (RIP-seq) on isolated mitochondria from 
the mTRANI-GFP lines, using mito-GFP as a 
control. To discriminate the mTRAN1-specific 
immunoprecipitated RNA fragments from the 
GFP-control background, we used “DiffSegR” 
(47, 48), which delineates boundaries of dif- 
ferential regions without using preexisting 
annotations, allowing precise identification of 
enriched regions. The RIP-seq analysis showed 
that mTRANI1 binds 5’ UTR regions, generally 
preferring the UTRs to CDS (Fig. 5A). Further- 
more, we observed clear evidence that the 
mTRAN1-bound 5’ UTR regions contained the 
same CUUUxU/AAGAAx/AxAAAG motifs for 
nad9, cox2, and cox3 as used for the REMSAs 
(Fig. 4 and 5B). The 7ps4 mRNA has no 5’ UTR, 
but we found an mTRAN1-interacting segment 
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at the 5’ of the mRNA, with a clear peak cov- 
ering an AAAAAA motif positioned 13 bases 
after the ATG (Fig. 5B). Significant mTRANI- 
bound regions were found in the 5’ UTR or 
just downstream of the AUG for 17 protein- 
encoding genes. A MEME motif search of these 
regions confirmed enrichment of COUUUxU 
and AAGAAx motifs (data S10). Together, our 
findings indicate that mTRANI binds to CUUUxU 
and AAGAAx/AxAAAG-like motifs, which are 
potential mitoribosome binding sites in the 5’ 
regions of plant mitochondrial mRNAs, to in- 
itiate translation. 


mTRAN proteins are likely universal plant 
mtSSU translation initiation factors 


To further assess how translation of mitochon- 
drial mRNAs is affected in the mtran mutants, 
we performed ribosome profiling analysis 
(Ribo-seq), in which short RNAse-protected 
mRNA fragments covered by the translating 
ribosomes (called ribosome footprints) were 
isolated and sequenced. To compare the rela- 
tive density of ribosome footprints on each 
mitochondrial mRNA between mtran mu- 
tants and WT, the calculated densities were 
normalized to both mRNA length and abun- 
dance determined by qRT-PCR (data S11). 
As observed in our polysome fractionation 
analysis, all mitochondrial mRNAs in the 
mtran mutants showed much lower ribosome 
loading levels compared with the WT (Fig. 
6A). A single, possibly artefactual, strong foot- 
print observed in ccmFN, masked the reduc- 
tion in footprints across the rest of the ccomFN;, 
mRNA in both mtran double mutants. In gen- 
eral, mtranI-2/2-2 have lower ribosome den- 
sities compared with the mtranJ-1/2-1 mutant, 
especially on mRNAs encoding OXPHOS pro- 
teins (~0.5 times on average). Visualization of 
mitoribosome footprints of representative mito- 
chondrial mRNAs clearly showed that transla- 
tion activity was very low in mtranI-2/2-2 and 
moderately low in mtranI-1/2-1 (Fig. 6B). This 
shows that not only a subset—but virtually 
all—mitochondrial mRNAs depend on mTRAN 
proteins for translation, indicating that mTRAN 
proteins likely are “universal” (for all plant 
mRNAs) mtSSU subunits involved in transla- 
tion initiation. 


Discussion 


Phylogenetic analysis revealed that mTRANs 
are likely land plant-specific proteins existing 
as single-copy genes in most plant species. 
Based on the more-pronounced growth defects 
in mtranI mutants than mtran2 mutants, in- 
consistent detection of mTRAN2 in proteomic 
studies, and higher mTRAN1 protein number 
per mitochondrion (3), mTRANTI is likely the 
major isoform in Arabidopsis. When mTRANI 
is knocked out, however, m7TRAN2 can appar- 
ently take over most functions. When knocking 
out both isoforms, a range of phenotypes was 
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observed, including embryo lethality (mtrant- 
3/2-3) and severe (mtranI-2/2-2) to moderate 
(mtrant-1/2-1) growth defects. However, it is 
not as clear why mtranI-2/2-2 is Gust) viable 
and mtranI-3/2-3 is not. We thus propose that 
mTRANs are essential for normal develop- 
ment of plants. 

Knocking out m7RANs reduced complex I, 
III, IV, and V abundance and activity while 
increasing AOX capacity. Because all complexes 
containing mitochondrially encoded subunits 
were strongly reduced in abundance and ac- 
tivity, mTRANs must have a very fundamental 
function in plant OXPHOS biogenesis. In com- 
parison, mutants in mitochondrial/chloroplast 
RNA polymerase rpotmp are only affected in 
complex I and IV abundance (32). Together, 
impaired OXPHOS-dependent ATP production 
likely causes the growth retardation phenotype 
in mtran mutants. This may be conveyed sys- 
temically by negatively affecting, for example, 
auxin signaling, as observed in the RNA-seq 
data (data S4). An antagonistic relation between 
UPR™ and auxin was suggested previously (49) 
and is underlined by the up-regulation of UPR™ 
markers. Down-regulation of chloroplast- 
encoded transcripts by impairing mitochon- 
drial respiration was also previously reported 
(32, 50), suggesting that an unknown retrograde 
pathway may reduce chloroplast transcription. 

mTRAN1/2 interacted with 17 mtSSU ribo- 
somal proteins. Two proteomic studies also 
identified mTRAN1/2 as mtSSU components 
using independent methods (/4, 75). Our co-IP 
additionally identified five plant-specific mtSSU 
rPPR proteins (74, 15). We discovered proteins 
interacting with mTRANs identified only in 
our analysis, including nad7/nad9, which were 
less abundant in mtran double mutants. We 
therefore suspect that we captured nascent 
Nad7/Nad9 peptides that were actively being 
translated by the mitoribosome. The in organ- 
ello protein synthesis assays, showing reduced 
translation rates in the mtran double mutants, 


further support that mTRANs are key mtSSU_ - 


components required for efficient translation 
in mitochondria. 

Polysome fractionation showed that a much 
larger proportion of mitochondrial mRNAs is 
not bound by mitoribosomes in mtran double 
mutants compared with the WT, suggesting 
inefficient translation initiation. The reduced 
ribosome loading was also observed for nad6 
and rps4 mRNAs, although Nad6 showed no 
changes and rps4 was more abundant at the 
protein level. This suggests that Nad6 protein 
may be quite stable, possibly even if not in- 
corporated into the fully assembled complex I. 
The ribosome fractionation showed that a 
small fraction of mRNAs (generally <10%) has 
high ribosome loading in mtranJ-1/2-1 (Fig. 3C), 
observed in fractions 8 and 9. We propose that 
this small fraction of mRNAs could still be bound 
by the mtranI-1/2-1 mutant mitoribosomes, 
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Fig. 5. mTRAN1 binds A/U-rich A 
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sites in the 5' UTRs of nad9, 
cox2, and cox3 and in the 5’ end 
of the rps4 CDS. Red bar charts 
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perhaps due to different secondary/tertiary 
structure of the 5’ UTR allowing better mito- 
ribosome entry. As a result, these rare “bind- 
able” mRNAs may then be heavily translated 
by all available mitoribosomes, partially com- 
pensating for the lack of initiation on the vast 
majority of “unbindable” mRNAs. This com- 
pensation in the mtranJ-1/2-1 mutant (in which 
truncated mTRANI protein is detected) likely 
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explains the higher vigor of mtranI-1/2-1 com- 
pared with mtranI-2/2-2 plants. 

The question arises as to how mTRAN pro- 
teins play such a fundamental role in mitochon- 
drial translation. The polysome fractionation 
suggests that the problem lies in the very first 
steps, where the mitoribosome detects and 
binds the mRNA. Plant mitochondrial mRNAs 
lack a Shine-Dalgarno (SD) sequence, and plant 
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mitochondrial rRNAs do not have anti-SD se- 
quences (13). Our data suggest that mTRANs 
are recognition factors, which would mean that 
in plant mitochondria a protein-mRNA inter- 
action mediates ribosome binding, rather than 
an rRNA-mRNA interaction. A cryo-EM study 
of the structure of plant mitoribosomes (J6) 
proposed that mTRANI (named as ms83 and 
rPPR10 by the authors) is a PPR protein, which 
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are known RNA binding proteins. Indeed, an- 
other study identified mTRANI as an mRNA- 
binding protein (57). However, mTRAN proteins 
are far from “classical” PPR proteins. First, the 
largest PPR database based on 44,562 PPR 
protein sequences from >1000 transcriptomes 
(52, 53) does not identify mTRANs as PPR 
proteins. Furthermore, “TPRpred” predicts other 
rPPRs as PPR proteins with 100% confidence, 
yet only gives a confidence score of 4 to 14% 
for mTRANI/2. Additionally, the gene struc- 
ture of mTRAN with 10 introns is highly aber- 
rant compared with classical PPR genes (fig. 
S3A), which usually have 0 to 1 introns (54). 
Introns are associated with higher mRNA sta- 
bility, and indeed m7RANs have longer average 
transcript half-lives of 4.4 hours, as compared 
with 1.7 hours for the average PPR gene with 0 
to 1 introns (55), resulting in 1.8 to 2.4 times 
higher relative expression (data S12). Never- 
theless, only one m7TRAN/I splice form seems 
dominant while two are expressed for mTRAN2 
(of which one is low abundant with an early stop 
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codon) according to PastDB (56). However, 
protein structure prediction tools AlphaFold, 
RoseTTAFold, and iTASSER showed that 
mTRAN proteins are likely to have «-solenoid/ 
PPR protein-like structures (57). Unfortunate- 
ly, potential RNA binding sites could not be 
predicted by aPPRove (43) and PPRCODE (44). 
This further suggests that mTRAN proteins are 
a-solenoid proteins that bind RNA differently 
to classical PPR proteins. Based on available 
protein structure predictions, 6 to 10 repeated 
motifs were found in the mTRAN protein struc- 
ture, which suggests that they could interact 
with a mRNA motif of around 6 to 10 bases, 
matching our identified CUUUxU/AAGAAx/ 
AxAAAG motifs. mTRANs were suggested to 
sit in a cleft where the incoming mRNA may 
be positioned in the cryo-EM mitoribosome 
structure (16). It should be noted that the reso- 
lution of several regions in the currently available 
plant mitoribosome structure is, however, too 
low to accurately position mTRANs. Waltz et al. 
(16) hypothesized that mTRANT is part of the 
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region that might recognize an AxAAA-related 
motif of mitochondrial mRNA 5’ UTRs, thus 
acting as a SD-anti-SD-like recognition system. 


« 


¢ 


Additionally, their motif analysis suggested that - 


the 5’ UTRs of only 17 mitochondrial mRNAs 
contained a loosely conserved AxAAA consen- 
sus. However, our study showed that mTRANs 
are required for translation of all mitochon- 
drially encoded genes, of which many do not 
contain such an AxAAA 5’ UTR motif but rather 
a CUUUxU or AAGAAx motif. 

Indeed, mitoribosome loading was lower for 
all mRNAs in mtran mutants. Thus, mMTRAN 
proteins may be “universal” recognition fac- 
tors for all mitochondrial mRNAs, rather than 
mRNA-specific initiation cofactors (9, 10). The 
universally reduced mitoribosome loading in 
the mtran double mutants is also different 
from the ribosome loading in rps70 mutants 
(58), a mtSSU subunit thought to be involved 
in elongation (59). In the rpsi0 mutants, varia- 
ble increased or decreased ribosome loading 
was observed on mRNAs encoding OXPHOS 
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components, but higher ribosome loading 
was found for translation- and cytochrome 
c maturation-related mRNAs. This difference 
further suggests that mTRANS are involved in 
translation initiation and not elongation. 

Most Arabidopsis mitochondrial mRNAs 
have long 5’ UTRs, which is different from 
mammals but relatively similar to yeast (9). 
Our results indicate that the putative mito- 
ribosome binding sites CUUUxU and AAGAAx/ 
AxAAAG in plant mitochondrial mRNAs are 
not at a particular distance from the start 
codon (figs. S10 and S11), though with some 
preference for -25 bases upstream of AUG. 
Therefore, we hypothesize that either the 2/3D 
structure of the 5’ UTRs may bring the ribo- 
some binding sites physically closer to the 
start codon, or that plant mtSSU can effi- 
ciently scan long 5’ UTRs to find the correct 
AUG, perhaps with the help of other mRNA- 
specific cofactors (60). In green algae, an A/U- 
motif is conserved around —25 bases from the 
start codon (fig. S11); yet mTRAN genes are 
absent. So perhaps mTRANSs evolved in land 
plants to compensate for increasing variability 
in mitochondrial ribosome binding site posi- 
tion, or conversely, the appearance of mTRANs 
allowed this variability to occur. These findings 
indicate that translational initiation by mito- 
chondrial ribosomes occurs in a different way in 
plants as compared with in fungi and animals 
(12, 61). 


Materials and methods summary 


Plants were grown under long day condition 
(16h light/8h dark, approximately 120 umol 
photons m~?s~’). A. thaliana Col-0 was used 
as the WT. The T-DNA insertion lines SALK_ 
044.671, GABI_915G12, WiscDsLox485-488E21, 
SALK_054298, SALK_096907, and SALK_099373 
were obtained from the European Arabidopsis 
Stock Centre. The double-knockout mutants 
mtranti-1/2-1 and mtranI-2/2-2 were obtained 
by crossing. Stable transgenic and complemen- 
tation lines were generated by transforming 
PB7FWG2-mTRANI/2 into the WT or mtranI-2/ 
2-2 by floral dipping, respectively. Plant mito- 
chondria were purified as described in Tran et al. 
(62) for further analysis, e.g., BN-PAGE, West- 
ern blot, MS/MS analysis, RIP-seq, and in or- 
ganello translation assays. For whole genome 
RNA-seq analysis, 11-day-old WT and 18-day- 
old mtranI-2/2-2 seedlings grown on MS agar 
plates were harvested at developmental growth 
stage 1.04 (26). Total RNA was isolated to gen- 
erate RNA-seq libraries for sequencing. Bioin- 
formatics and statistical analysis were carried 
out according to Baudry et al. (63). Polysomal 
RNA purified as described in (64) was used for 
qRT-PCR. mTRANI protein was expressed and 
purified from E. coli. 5'Cy5 labeled RNA probes 
were obtained from Sigma-Aldrich. REMSAs 
were performed as described previously (65, 66). 
Mitoribosome footprints were prepared from 
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Arabidopsis flower buds as previously described 
(8). Isolated mitochondria from mito-GFP and 
mTRANI-GFP lines were subjected to immuno- 
precipitations for RIP-seq analysis as previously 
described (67). The RIP-seq data were processed, 
mapped against Col-O mitogenome, and ana- 
lyzed using DiffSegR to identify mTRANI- 
binding sequences. PLMdetect (45) was used 
to analyze preferred hexameric motif locations 
in the -300/+200 genomic upstream regions 
from the start codon of mitochondrial protein- 
coding genes from all species available in the 
NCBI database. A complete “Materials and 
Methods” section is provided in the Supple- 
mentary Information. 
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INTRODUCTION: Tissue plasminogen activator 
(tPA) is a serine protease that initiates fibrinol- 
ysis to remove excessive blood clots and restore 
blood flow. Intravenous infusion of recombinant 
tPA is approved as a thrombolytic therapy in 
thrombotic cardiovascular diseases, including 
ischemic stroke and myocardial infarction. Low 
plasma tPA activity is associated with a higher 
risk of atherosclerotic cardiovascular disease and 
atherogenic apolipoprotein B (apoB)-lipoprotein 
cholesterol levels in humans. However, whether 
low tPA elevates apoB-lipoprotein cholesterol 
is unknown. 


RATIONALE: Given the central role of hepato- 
cytes in apoB-lipoprotein production and recent 
studies showing that hepatocytes synthesize 
tPA, we sought to identify possible links be- 
tween tPA and apoB-lipoprotein assembly and 
secretion from hepatocytes. 


RESULTS: To investigate the role of hepatocyte 
tPA in apoB-lipoprotein metabolism, tPA expres- 


sion was silenced or deleted in the hepatocytes 
of mice. These manipulations resulted in higher 
plasma apoB and cholesterol levels, indepen- 
dent of any changes in hepatic low-density 
lipoprotein receptor (LDLR) or apolipoprotein 
E (apoE) expression or Apob mRNA level. The 
higher plasma cholesterol in these mice was 
distributed in the very-low-density lipopro- 
tein (VLDL) and low-density lipoprotein (LDL) 
fractions. In human primary hepatocytes, 
silencing tPA increased the secretion of new- 
ly synthesized [?H]-labeled apoB despite no 
change in Apob mRNA. Thus, hepatocyte- 
tPA deficiency increases the secretion of 
apoB-lipoproteins. 

Lipidation is a key factor determining the 
fate of intrahepatic apoB. Poorly lipidated apoB 
undergoes intracellular degradation, whereas 
fully lipidated apoB is efficiently secreted as 
VLDL particles with larger size and lower den- 
sity. Silencing hepatocyte tPA in Ldlr” mice led 
to larger VLDL particles in the plasma, with 
more triglyceride per VLDL particle, indicating 


lipidated in the endoplasmic reticulum (EF.-, 
microsomal triglyceride transfer protein (MTP), 
which incorporates neutral lipids onto nascent 
apoB. Transfecting human primary hepato- 
cytes with a plasmid encoding tPA led to lower 
secretion of newly synthesized [?H]-labeled 
apoB. Proximity ligation, confocal imaging, 
and immunoprecipitation assays revealed that 
tPA interacts with apoB in the hepatocyte ER. 
In addition, recombinant tPA interacts with 
solid-phase immobilized LDL, inhibits MTP- 
apoB interaction, and reduces neutral lipid 
transfer to apoB. Moreover, the serine pro- 
tease inactive tPA (S513A) also interacts with 
solid-phase immobilized LDL, reducing both 
apoB secretion by human primary hepatocytes 
and MTP-mediated lipid transfer activity to 
the same degree as wild-type tPA, which in- 
dicates that this action of tPAis independent | 
of its protease activity. The tPA-LDL inter- 
action is inhibited by antibodies against 
the Kringle 2 (K2) domain of tPA, the MTP- 
interacting regions at the N terminus of apoB, 
and the lysine analog tranexamic acid. Fur- 
ther, deleting the K2 domain or mutating the 
lysine-binding site in the K2 domain of tPA ‘ 
abrogates the effects of tPA on limiting apoB 
secretion. These data indicate that tPA, par- 
tially through the lysine-binding site on its 
K2 domain, binds to the N terminus of apoB, ‘ 
blocking the interaction between apoB and 
MTP in hepatocytes. This process reduces 
VLDL assembly and plasma apoB-lipoprotein 
cholesterol levels. 

Plasminogen activator inhibitor 1 (PAI-1) is a 
rapidly acting serine protease inhibitor (serpin) 
of tPA. Upon lipid loading of hepatocytes, PAT-1 
forms a complex with tPA and sequesters tPA 
away from apoB, which allows apoB to be 
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Intracellular tPA-PAI-1 interaction in the ER of hepatocytes determines apoB lipidation, VLDL 
assembly, and secretion. |n the basal state, tPA interacts with apoB and inhibits MTP-apoB interaction in 
the ER of hepatocytes, limiting MTP-mediated apoB lipidation, VLDL assembly, and secretion. When 
hepatocytes are loaded with lipid, PAI-1 sequesters tPA away from apoB by forming a complex with tPA, 
allowing the lipidation of apoB and the assembly and secretion of VLDL. [Figure created with Biorender] 
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particles and lower plasma levels of apoB- 
lipoprotein cholesterol. 


CONCLUSION: The findings in this study sug- 
gest a mechanism that fine-tunes VLDL assem- 
bly through intracellular interactions among 
tPA, PAI-1, and apoB in hepatocytes, thereby 
affecting the plasma levels of atherogenic 
apoB-lipoproteins. Knowledge of this mech- 
anism of hepatic lipoprotein regulation may 
suggest therapeutic strategies for lowering 
atherogenic apoB-lipoproteins and cardiovas- 
cular risk. 
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Apolipoprotein B (apoB)-lipoproteins initiate and promote atherosclerotic cardiovascular disease. 
Plasma tissue plasminogen activator (tPA) activity is negatively associated with atherogenic apoB- 
lipoprotein cholesterol levels in humans, but the mechanisms are unknown. We found that tPA, 
partially through the lysine-binding site on its Kringle 2 domain, binds to the N terminus of apoB, 
blocking the interaction between apoB and microsomal triglyceride transfer protein (MTP) in hepatocytes, 
thereby reducing very-low-density lipoprotein (VLDL) assembly and plasma apoB-lipoprotein cholesterol 
levels. Plasminogen activator inhibitor 1 (PAI-1) sequesters tPA away from apoB and increases VLDL 
assembly. Humans with PAI-1 deficiency have smaller VLDL particles and lower plasma levels of apoB- 
lipoprotein cholesterol. These results suggest a mechanism that fine-tunes VLDL assembly by 
intracellular interactions among tPA, PAI-1, and apoB in hepatocytes. 


polipoprotein B (apoB)-lipoproteins ini- 
tiate and promote atherosclerotic cardio- 
vascular disease (CVD) (/, 2). The major 
source of blood apoB-lipoproteins is he- 
patocytes, where very-low-density lipo- 
proteins (VLDLs) are assembled and secreted. 
Circulating VLDL is then progressively hydro- 
lyzed in the blood to form intermediate-density 
lipoprotein (IDL) and low-density lipopro- 
tein (LDL) (2). Currently recommended lipid- 
intervention therapies to prevent CVD primarily 
use statins or protein convertase subtilisin/ 
kexin type 9 (PCSK9) inhibitors, both of which 
lower LDL by enhancing hepatic LDL re- 
ceptor (LDLR)-mediated LDL clearance (3). 
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However, these treatments have only a modest 
effect on other atherogenic apoB-containing 
lipoproteins, such as VLDL and IDL (4, 5), 
which contribute to the residual CVD risk in 
populations with well-controlled LDL cho- 
lesterol (6, 7). Thus, therapies that inhibit 
hepatic VLDL production might be useful in 
decreasing CVD risk because they would 
lower all atherogenic apoB-lipoproteins. apoB, 
a large complex protein, is the structural scaf- 
fold for forming VLDL (8). VLDL is assembled 
in hepatocytes by incorporating triglyceride, 
cholesteryl ester, and phospholipid onto apoB 
to form spherical particles (9). This process, 
known as apoB lipidation, depends on both 
lipid availability and a neutral lipid transporter, 
microsomal triglyceride transfer protein (MTP) 
(0, 11). MTP binds to apoB in the hepatocyte 
endoplasmic reticulum (ER) and transfers lip- 
ids to apoB (10). When MTP and/or lipids are 
unavailable, VLDL cannot be assembled, and 
newly translated apoB is then targeted for deg- 
radation (72). Although the essential role of MTP 
in apoB lipidation has been well established, 
much less is known about the regulation of 
apoB-MTP interaction and MTP-mediated lipid 
transfer to apoB. 

Previous studies have shown that low plas- 
ma tissue plasminogen activator (tPA) activity 
is associated with a higher risk of atheroscle- 
rotic CVD (73-15), but whether decreased fi- 
brinolysis contributes to CVD in this setting 
remains unknown. Another plausible mech- 
anism linking low tPA to CVD is elevated plas- 
ma apoB-lipoprotein cholesterol, which is seen 
in humans with decreased tPA activity (16-18). 
However, whether apoB-lipoprotein metabolism 
and tPA are causally linked and, if so, the mech- 
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anisms involved remain unknown. Given the 
central role of hepatocytes in apoB-lipoprotein 
production and our recent studies in mice 
showing that hepatocytes synthesize tPA and 
contribute ~40 to 50% to both the plasma basal 
tPA concentration and to fibrinolysis (19, 20), 
we sought to identify possible links between 
hepatocyte tPA and apoB-lipoprotein assem- 
bly and secretion. Our investigation revealed 
that endogenous hepatocyte tPA limits VLDL 
assembly by directly interacting with apoB, 
interrupting the interaction between apoB and 
MTP and thereby impairing MTP-dependent 
neutral lipid transfer and apoB lipidation. We 
also show that plasminogen activator inhibitor 
1 (PAI-1) binds to tPA within hepatocytes and 
abolishes the effect of tPA on limiting VLDL as- 
sembly. Our in vivo data suggest that the tPA- 
PAI-1 interaction enables physiologic postprandial 
VLDL assembly. Thus, our findings that tPA | 
can alter the production of atherogenic apoB- 
lipoproteins have potential therapeutic impli- 
cations for lowering atherosclerotic CVD risk. 


Lowering hepatocyte tPA raises plasma apoB 
To investigate the role of hepatocyte tPA in de- 


termining plasma apoB-lipoprotein levels ina ‘ 


setting where hepatic clearance by the LDL 
receptor is not a factor, tPA expression was 
silenced in the hepatocytes of Ldlr mice fed 
a high-fat, high-cholesterol Western diet (WD). 
This silencing was accomplished by admin- 
istering an adenovirus-associated virus 8 (AAV8) 
expressing a hairpin RNA against Plat mRNA, 
which encodes tPA protein, driven by the H1 
promoter, AAV8-H1-shPlat (sh-tPA) (79, 20). 
The hepatocyte tPA-silenced mice showed a 
47% higher plasma total cholesterol level (P < 
0.01) and 28% higher plasma apoB-100 (P < 
0.05) compared with mice receiving AAV8-H1- 
scrambled RNA (scr) (Fig. 1A and fig. SIA). 
Plasma lipoprotein fraction profiling by fast 
protein liquid chromatography (FPLC) re- 
vealed that the hepatocyte tPA-silenced mice 
had more cholesterol and apoB-100 in the 
VLDL and LDL fractions and more triglyceride 
in VLDL (Fig. 1A). Using a different hypercho- 
lesterolemic model, we showed that silenc- 
ing hepatocyte tPA in WD-fed Apoe knockout 
(Apoe™! ~) mice led to 56% higher plasma total 
cholesterol (P < 0.05) and 28% higher apoB- 
100 (P < 0.05) accompanied by increases in 
VLDL and LDL cholesterol, triglyceride, and 
apoB-100 (Fig. 1B and fig. S1B). Similar results 
were observed in WD-fed wild-type (WT) mice 
(fig. S1C and fig. S2). To further validate the 
role of hepatocyte tPA in determining plasma 
apoB-lipoprotein levels, the hepatocyte tPA- 
knockout mouse model was generated by 
administering Plat!” mice with an AAV8 
expressing a Cre recombinase driven by the 
thyroxine-binding globulin (TBG) promoter, 
AAV8-TBG-cre (Cre). The WD-fed hepatocyte 
tPA-knockout mice showed a 30% higher plasma 
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Fig. 1. Silencing hepatocyte tPA increases apoB-lipoprotein cholesterol and 
apoB independently of LDLR or apoE. (A) Ldir'’~ mice were treated with 
AAV8-H1-shPlat (sh-tPA) or AAV8-H1-scrambled control (scr) and then fed the 
WD for 8 weeks. The livers were assayed for tPA protein, and plasma samples were 
assayed for total cholesterol and apoB-100 concentrations and for FPLC profiles 

of cholesterol, triglyceride, and apoB-100 (n = 9 to 10 mice per group). (B) Apoe” 
mice were treated with AAV8-H1-shPlat (sh-tPA) or AAV8-H1-scrambled control 
(scr) and then fed the WD for 8 weeks. The livers were assayed for tPA 
protein, and plasma samples were assayed for total cholesterol and apoB-100 
concentrations and for FPLC profiles of cholesterol, triglyceride, and apoB-100 

(n = 5 mice per group). (C) Plat” mice were treated with AAV8-TBG-cre (Cre) 
or AAV8-TBG-GFP (GFP) and then fed the WD for 8 weeks. The livers were assayed 


for tPA protein, and plasma samples were assayed for total cholesterol and 
apoB-100 concentrations and for FPLC profiles of cholesterol, triglyceride, and 
apoB-100. The cholesterol in the VLDL fractions is shown in a zoomed-in smaller 
graph (n = 6 mice per group). (D) Human primary hepatocytes were treated 
with siRNA against tPA mRNA (si-tPA) or scrambled RNA for 24 hours. Cell 
culture medium apoB was quantified by immunoblot. VLDL fractions were isolated 
by ultracentrifugation, and cholesterol and triglyceride concentrations in VLDL 
fractions were assayed. (E) McA-RH7777 cells were treated with siRNA against tPA 
mRNA (si-tPA) or scrambled RNA for 24 hours. VLDL was isolated from the 
medium by ultracentrifugation, and cholesterol and triglyceride concentrations in 
VLDL were assayed. Cell culture medium apoB was assayed by immunoblot. Data 
are shown as means + SEMs; *P < 0.05 by two-tailed Student's t test. 


total cholesterol level (P < 0.01) and 33% higher 
plasma apoB-100 level (P < 0.05), accompanied 
by higher cholesterol and apoB-100 in the VLDL 
and LDL fractions and higher triglyceride in 


TBG-GFP [green fluorescent protein (GFP)] (Fig. 
1C and fig. S1D). 

Silencing tPA did not change Apob mRNA 
in the livers of Ldlr/ >i Apoe’ ~, or WT mice 


VLDL, compared with mice receiving AAV8- 
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(fig. S3, Ato C), which suggests that increased 
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plasma apoB was not because of increased 
apoB synthesis. Moreover, the plasma apoB 
data were not a result of changes in apoE- or 
LDLR-mediated pathways because plasma 
apoE levels and liver LDLR expression remained 
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unchanged (fig. S4, A and B). Collectively, these 
data show that silencing hepatocyte tPA raises 
plasma apoB lipoprotein-cholesterol through 
a mechanism independent of the LDL recep- 
tor or apoE. 

To examine hepatocyte-intrinsic effects and 
to link this finding to human relevance, tPA 
expression was silenced in primary human 
hepatocytes using small interfering RNA 
(siRNA) against PLAT mRNA (si-tPA). The si- 
tPA treated cells had higher levels of apoB-100 
in their media compared with control (scr) 
hepatocytes (fig. SSA and Fig. 1D), whereas 
hepatocyte APOB mRNA levels were similar in 
the two groups of cells (fig. S5B). Further, the 
contents of cholesterol and triglyceride were 
higher in the VLDL fractions isolated from the 


Fig. 2. tPA limits apoB lipidation 

in the ER. (A) WT mice were A 
treated with AAV8-H1-shPlat 

(sh-tPA) or AAV8-H1-scrambled 

control (scr) and then fed the WD 

for 14 weeks. Mice were injected 
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culture medium of tPA-silenced human hepa- 
tocytes (Fig. 1D). Similar findings were ob- 
served in cultured McA-RH7777 cells (Fig. 1E 
and fig. S5, C and D), a rat hepatoma cell line. 
The McA-RH7777 cell is an established model 
system to study VLDL production because it 
synthesizes VLDL-sized apoB particles similar 
to human hepatocytes (27). These data show that 
the ability of tPA to regulate apoB-lipoprotein 
secretion is a cell-intrinsic property of hepato- 
cytes and is applicable to humans. 


tPA limits apoB lipidation in the ER 

The data in Ll” and Apoe! ~ mice suggest that 
tPA limits the production—not the clearance— 
of apoB-lipoproteins. To examine this point 
further, we investigated the effect of hepato- 


cyte tPA silencing on VLDL secretion in mice 
injected with the nonionic surfactant deter- 
gent poloxamer 407 (P4.07). The P407 injection 
blocks VLDL clearance by inhibiting lipo- 
protein lipase activity and VLDL lipolysis (22). 
Hepatocyte tPA-silenced mice had a faster 
rise in triglycerides (Fig. 2A), which suggests 
increased VLDL production. Moreover, plas- 
ma apoB-100, which is derived only from 
hepatocytes, increased to a greater degree than 
plasma apoB-48, which is produced by both 
hepatocytes and intestinal epithelial cells in 
mice (23) (Fig. 2B). These data, when combined 
with those presented above, suggest that si- 
lencing hepatocyte tPA increases plasma apoB- 
containing lipoprotein levels by promoting 
VLDL production. 
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treated with AAV8-H1-shPlat 


(sh-tPA) or AAV8-H1-scrambled control (scr) and then fed the WD for 8 weeks 

(n = 9 to 10 mice per group). VLDL was isolated by ultracentrifugation, and VLDL 
diameter was measured by dynamic light scattering. (E) Ldir’“~ mice were 
treated with AAV8-H1-shPlat (sh-tPA) or AAV8-H1-scrambled control (scr) and 
then fed the WD for 8 weeks. VLDL was isolated by ultracentrifugation and 
assayed for the ratio of triglyceride to apoB-100 (n = 9 to 10 mice per group). 

(F) Whole-body tPA knockout mice (holo-tPA-KO) were treated with AAV8-TBG-Plat 
(tPA) or AAV8-TBG-lacZ (LacZ) and then fed a normal chow diet for 8 weeks. 
Plasma samples were assayed for VLDL cholesterol, LDL cholesterol, and apoB-100 
concentrations (n = 6 mice per group). (G@) Human primary hepatocytes were 
transduced with a plasmid encoding tPA with C-terminal HA tag (tPA-HA) or GFP. 
After 48 hours, apoB secretion was measured using a [°H]-labeling method as 
follows: hepatocytes were incubated for 20 min in [°H]-leucine-containing medium 
and chased for 3 hours in [°H]-leucine-free medium, and then radioactivity 
associated with apoB in the cell medium was quantified by scintillation counting. 
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(H) Human primary hepatocytes were treated with siRNA against tPA mRNA 
(si-tPA) or scrambled RNA for 24 hours. apoB secretion was measured using 
[°H}-labeling, as in (G). Radioactivity associated with apoB in cell medium was 
quantified by scintillation counting. (I) Human primary hepatocytes were treated 
with siRNA against tPA mRNA (si-tPA) or scrambled RNA for 24 hours. VLDL was 
isolated by ultracentrifugation, and VLDL diameter was measured by dynamic light 
scattering. (J) Human primary hepatocytes were treated with siR 


A against tPA 


si-tPA) or scrambled RNA for 24 hours. VLDL was isolated by ultracentrifugation 
and assayed for the ratio of triglyceride to apoB-100. (K) Human primary hepatocytes 
were treated with siRNA against tPA mRNA (si-tPA) or scrambled RNA for 24 hours. 
The ER fraction was isolated, and proteins from the ER were extracted. apoB- 
lipoproteins extracted from the ER were further separated by density gradient 
ultracentrifugation and divided into six fractions of increasing density from fraction 1 
to 6. apoB from each fraction was measured by immunoblot. Data are shown as 
means + SEMs; *P < 0.05 by two-tailed Student's t test. 
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Lipidation is a key factor determining the 
fate of intrahepatic apoB. Poorly lipidated apoB 
undergoes intracellular degradation, whereas 
fully lipidated apoB is efficiently secreted as 
VLDL particles with larger size and lower 
density (24). Electron microscopic scanning 
of plasma VLDL particles isolated by density 
ultracentrifugation showed a shifted distrib- 
ution of the particles to larger diameters in the 
hepatocyte tPA-silenced Ldir~ mice (Fig. 2C). 
Moreover, dynamic light-scattering analyses 
(25-27) revealed that VLDL from sh-tPA mice 
had a larger hydrodynamic diameter than VLDL 
from control mice (Fig. 2D). These combined data 
suggest that the VLDL of the tPA-silenced mice 
have a higher lipid content, which is further sup- 
ported by the finding that the VLDL of the tPA- 
silenced mice had a higher triglyceride/apoB 
ratio (Fig. 2E). 

To explore the impact of increasing hepatocyte 
tPA expression on plasma apoB-lipoproteins 
in vivo, whole-body tPA knockout mice (holo- 
tPA-KO) were injected with AAV8-TBG-Plat 
(tPA), to restore tPA expression specifically in 
hepatocytes, or with AAV8-TBG-LacZ control 
(LacZ) (19, 20, 28-30). As predicted, plasma 
apoB-100 and VLDL- and LDL-cholesterol were 
lower in the mice treated AAV8-TBG-Plat versus 
AAV8-TBG-LacZ (Fig. 2F). 

To show relevance to lipoprotein production 
by human hepatocytes, apoB secretion was 
measured using a °[H]-leucine pulse-chase 
method (37) in human primary hepatocytes. 
Transfecting these cells with a plasmid en- 
coding tPA led to lower apoB-associated radio- 
activity in the cell medium, indicating lower 
secretion of newly synthesized apoB (Fig. 2G). 
By contrast, silencing tPA increased the se- 
cretion of newly synthesized apoB (Fig. 2H). 
Similar results were observed in tPA-silenced 
McA-RH7777 cells (fig. S6). However, incubat- 
ing human primary hepatocytes with recom- 
binant human tPA did not change culture 
medium apoB-100 levels and cholesterol and 
triglyceride contents in VLDL (fig. $7), which 
suggests that the hepatocyte tPA limits apoB 
lipidation and production intracellularly. 

Consistent with the mouse plasma data 
above, silencing tPA increased the diameter 
and triglyceride/apoB ratio of VLDL isolated 
from the cell medium (Fig. 2, I and J). These 
data suggest that silencing tPA increases apoB 
lipidation. ER is the major site of apoB lip- 
idation and VLDL assembly (32). The degree 
of apoB lipidation can be ascertained by the 
density of apoB lipoprotein particles—namely, 
greater lipidation of apoB results in lower 
density of the particle (33, 34). In this con- 
text, tPA-silenced human primary hepatocytes 
had overall higher ER-associated apoB and a 
higher proportion of apoB in lower-density 
fractions (fractions 1 and 2, Fig. 2K) than 
control hepatocytes. Because density and lipid- 
ation are inversely related, these findings fur- 
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ther support the hypothesis that tPA limits 
apoB lipidation in the ER. 


tPA blocks MTP-mediated VLDL assembly by 
interacting with apoB 


MTP is a chaperone that promotes intra- 
hepatic apoB lipidation by transferring and 
incorporating neutral lipids, notably triglyc- 
eride and cholesteryl esters, to apoB to as- 
semble VLDL in the ER (/0, 77). MTP-mediated 
lipid transfer to apoB involves its direct bind- 
ing to apoB (10, 35) because inhibiting the 
apoB-MTP interaction decreases apoB lipida- 
tion and secretion (17). We found that tPA 
silencing in human primary hepatocytes did 
not alter the level of MTP protein (Fig. 3A, 
input). However, MTP immunoprecipitated 
from tPA-silenced cells showed a higher con- 
tent of apoB compared with immunoprecipi- 
tates of MTP from control hepatocytes (Fig. 3A). 
These data suggest that silencing tPA increases 
apoB-MTP interaction, which would be ex- 
pected to increase lipid transfer to apoB (11). 
Consistent with this idea, microsomal frac- 
tions isolated from tPA-silenced hepatocytes 
had a twofold higher neutral lipid transfer 
activity (36) compared with microsomes from 
control cells (Fig. 3B, groups 1 and 2). This 
lipid transfer activity was due to MTP, as the 
MTP inhibitor CP-346086 (37) completely abol- 
ished the elevated neutral lipid transfer activity 
in microsomes from tPA-silenced hepatocytes 
(Fig. 3B, group 3). Similar findings were ob- 
served in McA-RH7777 cells (fig. S8). Conversely, 
when tPA was expressed in human primary 
hepatocytes, apoB-MTP interaction and neutral 
lipid transfer activity were decreased (Fig. 3, C 
and D). Moreover, purified recombinant tPA 
protein reduced MTP-mediated neutral lipid 
transfer to LDL in a dose-dependent manner 
(Fig. 3E). Thus, tPA inhibits apoB-MTP inter- 
action and reduces MTP-mediated lipid trans- 
fer activity. 

To examine tPA interaction with apoB, we 
analyzed primary human hepatocytes trans- 
fected with tPA with a C-terminal HA-tag and 
found that apoB eluted from the anti-HA pre- 
cipitates (Fig. 3F). This finding was further 
supported by the results of a tPA-apoB proximity- 
ligation assay in hepatocytes using labeled anti- 
tPA and anti-apoB. The data showed intracellular 
punctate fluorescence (Fig. 3G), which indi- 
cates that endogenous tPA and apoB interact 
in these cells. We also found, using confocal 
immunofluorescence microscopy, that tPA and 
apoB colocalize in the ER (Fig. 3H). Moreover, 
expression of a form of tPA that is retained in 
the ER because of a KDEL sequence (38, 39) at 
the C terminus of tPA (tPA-KDEL) reduced 
secretion of newly synthesized apoB (Fig. 31). 
Collectively, these data suggest that tPA inter- 
acts with apoB in the ER, which, as described 
below, is relevant to the lowering of apoB- 
VLDL assembly and secretion by tPA. 
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The protease activity of tPA depends on the 
serine residue at position 513, and substituting 
serine to alanine (S513A) completely abolishes 
the serine protease activity of tPA (40). We 
found that the serine protease mutant tPA 
(S513A) reduced both apoB secretion by hu- 
man primary hepatocytes and MTP-mediated 
lipid transfer activity to a similar degree as WT 
tPA (Fig. 3, I and J), indicating that this action 
of tPA is independent of its protease activity. 
We also showed that purified recombinant WT 
and S513A-tPA directly interacted in a dose- 
dependent manner with either LDL or apoB- 
100 immobilized to a microtiter plate surface 
(Fig. 4A and figs. S9 and S10). By contrast, 
purified tPA did not interact with solid-phase 
immobilized MTP (Fig. 4B). Most importantly, 
preincubating LDL with tPA inhibited the 
binding of LDL to MTP (Fig. 4C). Finally, sur- 
face plasmon resonance (SPR) studies provided 
further evidence of noncovalent interaction be- 
tween tPA and LDL (Fig. 4D). These combined 
data suggest that tPA binds apoB in a tPA- 
protease-independent manner and that this 
process blocks the interaction of apoB with 
MTP, thereby decreasing apoB lipidation and 
VLDL assembly and secretion. 


The Kringle 2 (K2) domain of tPA interacts 
with the N terminus of apoB 


The K2 domain of tPA has a lysine binding 
site, which is required for its interaction with 
fibrin (41). Within the K2 domain of human 
tPA, negatively charged aspartic acid residues 
at 236 and 238 are responsible for the binding 
of tPA to positively charged lysine residues 
(42). The surface-exposed N terminus of apoB 
(43) has a positively charged lysine-rich region 
that is responsible for its binding to MTP (44). 
The MTP-interacting region (amino acids 430 
to 570) (45) of the apoB N terminus can be 
partially blocked using a monoclonal antibody 
1D1, developed by immunizing mice with a hu- 
man apoB fragment comprising amino acids 


474 to 539 (46). In this context, we show that - 


the 1D1, but not an epitope control antibody 
raised against amino acids 4031 to 4080 (03 
domain) of human apoB, inhibited the interac- 
tion between tPA and solid surface-immobilized 
LDL (Fig. 4E). These data suggest that tPA 
interacts with the N terminus of apoB. 
Expressing tPA mutants lacking either the 
K2 domain (tPA-A-K2) or in which aspartic acid 
residues at positions 236 and 238 were sub- 
stituted with asparagine (tPA-D236, 238N) 
blocked the ability of tPA to lower apoB sec- 
retion as measured by pulse-chase assay (Fig. 
4F). Similarly, in the solid-phase protein bind- 
ing assay, an antibody against the K2 domain 
of tPA inhibited the binding of tPA to solid 
surface-immobilized LDL (Fig. 4G). Further- 
more, tranexamic acid (TXA), a lysine analog 
that inhibits tPA binding to fibrin, partially re- 
duced the interaction between tPA and solid 
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Fig. 3. tPA blocks apoB-VLDL 
assembly. (A) Human primary A 
hepatocytes were treated with 
siRNA against tPA mRNA (si-tPA) 
or scrambled RNA for 24 hours. 
Cell lysates (input) and anti-MTP 
immunoprecipitates (IP: MTP) 
were assayed for apoB and 
by immunoblot. (B) Human 
primary hepatocytes were treated 
with siRNA against tPA mRNA 
(si-tPA) or scrambled RNA (group 1) 
or tPA mRNA (si-tPA) (groups 2 
and 3) for 24 hours. The microsomal 
fraction was isolated and assayed 
for neutral lipid transfer activity 
with DMSO (groups 1 and 2) or 
without with CP-346086 (10 nM), 
an MTP inhibitor (group 3). 

(C) Human primary hepatocytes 
were transduced with a plasmid 
encoding tPA with C terminal 

HA tag (tPA-HA) or GFP for tPA, ug/ml - 
48 hours. Cell lysates (input) and H 

anti-MTP precipitates (IP: MTP) 
were assayed for apoB and MTP by 
immunoblot. (D) Human primary 
hepatocytes were transduced with a 
plasmid encoding tPA-HA or GFP 

or 48 hours. The microsoma 
raction was isolated and assayed 
or neutral lipid transfer activity. 
(E) The effect of recombinan 
human tPA on lipid transfer from 
donor vesicles to human LDL was 
assayed. (F) Human primary 
hepatocytes were transduced with a 
plasmid encoding tPA-HA or GFP 
or 48 hours. Cell lysates (input) and anti-HA immuno 
were assayed for apoB and tPA. (G) A proximity ligati 
measure apoB-tPA interaction in human primary hepa 


IP: MTP 


TP 
Input 


— 
o 


Relative neutral lipid 
transfer activity 
(ex/em, 465/535 nm) 

oN FD WD 


ApoB 


ApoB + Calnexin 


surface-immobilized LDL (Fig. 4H and fig. 
S11). Taken together, these data support the 
idea that the K2 domain of tPA, which contains 
lysine-binding sites at aspartate residues 236 
and 238, binds to the lysine-rich region of the 
N terminus of apoB (Fig. 41). 


PAI-1 sequesters tPA from apoB in hepatocytes 


Increased circulating intestinal apoB-48- 
lipoproteins with oral feeding enhances he- 
patic production of VLDL in humans (47), yet 
the regulatory mechanisms are not clearly 
understood. Obesity, often associated with 
dyslipidemia, increases tPA synthesis in he- 
patocytes (19). However, the increased tPA is 
overcompensated by a larger increase in the 
serpin inhibitor of tPA, PAI-1, resulting in de- 
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(H) Confocal microscopy immunofluorescence imaging was used to measure the 
subcellular localization of tPA and apoB in human primary hepatocytes. Scale 
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and plasma (19). PAI-1 covalently binds to tPA, 
forming a stable complex (48). Therefore, we 
hypothesized that postprandial lipid loading of 
hepatocytes would increase the intracellular 
interaction of tPA and PAI-1, leading to reduced 
free tPA. Reduced free tPA would then promote 
apoB-MTP interaction, leading to increased 
VLDL assembly and secretion. 

We first immunoblotted tPA in a PAI-1- 
immunoprecipitate from primary human 
hepatocytes and found a band at ~120 Kg, rep- 
resenting a covalently bound, sodium dodecyl 
sulfate (SDS)-stable complex of tPA (~’70 kD) 
and PAI-1 (~50 kD) (Fig. 5A). This interaction 
was further validated by a proximity-ligation 
assay using antibodies against tPA and PAI-1 
(Fig. 5B). Oleate is a known stimulus for apoB 


creased net functional free tPA in the livers 
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lipidation and VLDL production (49, 50). We 
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enzymatically inactive mutant of tPA (tPA-S513A), tPA with an ER retention 
signal sequence (tPA-KDEL), or GFP for 48 hours. apoB secretion was measured 
by [°H} labeling, as in Fig. 2. (J) The effect of recombinant WT tPA (tPA-WT) or « 
enzymatically inactive mutant of tPA (tPA-S513A) on lipid transfer from donor 
vesicles to human LDL was assayed. Data are shown as means + SEMs; *P < 
0.05 by two-tailed Student's t test (D) or by one-way ANOVA followed by 
Dunnett's test [(B), (E), (I), and (J)]. 


observed increased tPA-PAI-1 complex and re- 
duced free tPA as early as 1 hour after oleate 
treatment of human primary hepatocytes, which 
is the time needed to observe a stimulatory 
effect of oleate on VLDL production (50), the 
tPA-PAI-1 complex increased, and free tPA 
was reduced (Fig. 5, C and D). Therefore, the 
tPA-PAI-1 complex forms rapidly in the hepa- 
tocytes after oleate treatment. With prolonged 
oleate treatment for 6 and 24 hours, free tPA 
was further decreased, accompanied by in- 
creased tPA-PAI-1 complex formation (Fig. 5D). 
These results suggest a type of regulation of 
apoB lipidation and VLDL production, involv- 
ing PAI-1 interacting with tPA, when hepato- 
cytes are loaded with lipids. 

Silencing PAI-1 in human primary hepato- 
cytes led to higher free tPA and lower apoB 
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Fig. 4. The K2 domain of tPA 


interacts with the N terminus A — tPA-WT+LDL B 

of apoB. (A) A solid-phase binding _ = Lage _ 

assay was used to measure the £4.54 — tPAS513A E45 
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to measure the interaction 
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the binding of MTP to immobilized 
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with the interaction between human recombinant tPA and immobilized purified LDL. (H) A solid-phase binding assay was used to measure whether tranexamic 
acid (TXA) interferes with the interaction between recombinant human tPA and immobilized purified LDL. (I) A schematic diagram depicting the interaction between 
the N terminus of apoB and the K2 domain of tPA. The diagram was generated using biorender.com. Data are shown as means + SEMs; *P < 0.05 by one-way 
ANOVA followed by Dunnett's test. n.s., not significant (P = 0.05). 


secretion, as measured by pulse-chase analysis 
(Fig. 5, E and F). The inhibitory impact of si- 
lencing PAI-1 on apoB secretion was more 
prominent under oleate treatment conditions, 
with an ~400% increase in free tPA and an 
~60% decrease in apoB secretion. By contrast, 
without oleate treatment, there was only an 
~67% increase in free tPA and an ~25% de- 
crease in apoB secretion. Similar results were 
observed in the McA-RH7777 hepatocytes (fig. 
$12). These observations are consistent with 
the hypothesis that, under basal conditions, 
most tPA is free and not bound by PAI-1, and 
thus PAI-1 silencing leads to only a moderate 
increase in free tPA and a modest reduction 
in apoB secretion. However, with oleate load- 


Dai et al., Science 381, eadh5207 (2023) 


ing, more tPA is PAI-1-bound, and silencing 
PAI-1 leads to a robust increase in free tPA 
and a reduction in apoB secretion. Further- 
more, compared with silencing tPA alone, si- 
lencing both tPA and PAI-1 at the same time 
did not lower apoB secretion (Fig. 5G). These 
data support the idea that PAI-1 facilitates 
apoB lipidation by sequestering tPA from apoB. 

The formation of a complex between tPA 
and PAI-1 changes tPA’s conformation (57, 52), 
causing tPA to lose its ability to bind fibrin, 
which is mediated by the lysine binding site in 
its K2 domain (53). We therefore reasoned that 
the binding of PAI-1 to tPA blocks the lysine 
binding site in K2 and thereby blocks tPA-LDL 
interaction. Consistent with this idea, the PAI-1- 
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tPA complex was unable to bind to LDL and did 
not inhibit MTP-mediated neutral lipid trans- 
fer activity (Fig. 5, H and I). 

To test this finding in vivo, we delivered 
olive oil to WT mice by oral gavage. Two and 
6 hours after oral gavage, liver free tPA was 
decreased compared with baseline (Fig. 5J) 
without alteration of liver total tPA and PAI-1 
levels (fig. S13). These data suggest that the 
physiological function of hepatocyte tPA-PAI-1 
interaction in fine-tuning apoB lipidation after 
lipid loading to hepatocytes. As expected, he- 
patocyte PAI-1 knockout mice (19, 54) had 
higher liver free tPA and lower plasma apoB, 
total cholesterol, and cholesterol in VLDL and 
LDL fractions compared with their littermate 
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Fig. 5. PAI-1 sequesters tPA away from apoB, leading to increased VLDL assembly in hepatocytes. 
(A) Human primary hepatocytes lysates (input) and anti-PAI-1 immunoprecipitates (IP: PAI-1) were 
assayed for tPA and PAI-1 by immunoblot. (B) A proximity ligation assay was used to measure tPA-PAI-1 


interaction in human primary hepatocytes. Scale bars, 


10 um. (€) Human primary hepatocytes were treated 


for 6 hours with 0.4 mM oleate complexed with fatty acid-free BSA (oleate group) or fatty acid-free BSA 
alone (vehicle control). Cell lysates were assayed for tPA by immunoblot using the Jess Simple Western 

system. (D) Human primary hepatocytes were treated with 0.4 mM oleate complexed with fatty acid-free 
BSA (oleate group) or fatty acid-free BSA alone (vehicle control). Cell lysates were assayed for tPA—PA-1 


complex and PAl-1-free tPA concentrations by ELISA. 
with siRNA against PAI-1 mRNA (si-PAI1) or scrambled 
0.4 mM oleate complexed with fatty acid-free BSA (0 


(E) Human primary hepatocytes were treated 
RNA and then incubated in medium containing either 
leate group) or fatty acid-free BSA alone (vehicle 


control). Cell lysates were assayed for PAl-1-free tPA concentration by ELISA. (F) Human primary hepatocytes 


were treated with siRNA against PAI-l mRNA (si-PAI1) or scrambled RNA and then incubated in medium 


containing either 0.4 mM oleate complexed with fatty 
alone (vehicle control). apoB secretion was measured 
hepatocytes were treated with siRNA against tPA mR 


scrambled RNA. apoB secretion was measured by [SH]- 
was used to measure the interaction between immobilized LDL and 
complex. (I) The effect of recombinant human tPA and 


acid-free BSA (oleate group) or fatty acid-free BSA 
by [°H]-labeling, as in Fig. 2. (G) Human primary 

A (si-tPA) or against PAI-1 mRNA (si-PAI1) or 
labeling, as in Fig. 2. (H) A solid-phase binding assay 
recombinant human tPA or tPA-PAI-1 
tPA-PAI-1 complex on lipid transfer from donor 


vesicles to human LDL was assayed. (J) Normal chow diet-fed WT mice had their food withdrawn for 5 hours 
and were then euthanized at 0, 1, 2, and 6 hours after oral gavage with olive oil. Liver lysates were assayed 
for PAl-1-free tPA concentration by ELISA. Data are shown as means + SEMs; *P < 0.05 by one-way ANOVA 


followed by Dunnett's test [(D) to (G), (I), and (J)]. ns., 


controls (Fig. 6, A to D). In contrast to the 
higher VLDL production observed in hepatocyte 
tPA-silenced mice (Fig. 2A), hepatocyte PAI-1 
knockout mice had a slower rise in triglycer- 
ides after detergent P407 injection, suggesting 
lower VLDL production (Fig. 6E). Moreover, 
2 hours after an oral gavage of olive oil, plasma 
apoB-100 was increased by 46% in control 
mice but by only 13% in hepatocyte-PAI-1- 
KO mice (Fig. 6F). These combined data sug- 
gest that endogenous hepatocyte PAI-1, by in- 
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not significant (P = 0.05). 


teracting with tPA and sequestering tPA away 
from apoB, facilitates the physiologic increase 
in hepatic VLDL production that occurs in re- 
sponse to fat ingestion. 


PAI-1-deficient humans have lower 
apoB-cholesterol 


We collected plasma samples from humans 
with a homozygous loss-of-function mutation 
in SERPINEI, the gene encoding for PAI-1, and 
compared them with plasma from age-, gender-, 
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and BMI-matched control individuals from 
the same community (55). None of the indi- 
viduals were taking lipid-lowering agents or 
had a known history of CVDs. Plasma from 
the PAI-1-deficient subjects had 22% lower 
LDL-cholesterol (P < 0.05), 17% lower apoB-100 
(P < 0.05), and 18% lower VLDL-cholesterol (P = 
0.06) (Fig. 6G). Compared with control indi- 
viduals, the PAI-1-deficient individuals had 
higher tPA in VLDL (Fig. 6H), and VLDL- 
associated tPA levels were inversely associated 
with VLDL diameter (7 = —0.59, P < 0.01) (Fig. 
61). These data, together with those from hepa- 
tocyte PAI-1-deficient mice and PAI-1-silenced 
human primary hepatocytes (Fig. 5F and Fig. 
6D), are consistent with the hypothesis that 
the PAI-1 deficiency leads to higher free tPA in 
hepatocytes, which enables more tPA to inter- 
act with apoB and thereby limits apoB lip- 
idation and VLDL production. 


Taken together, our findings show that tPA ° 


directly interacts with apoB within the ER of 
hepatocytes and that this interaction reduces 
MTP-mediated VLDL assembly. Lipid load- 
ing to hepatocytes induces tPA-PAI-1 complex 
formation, sequestering tPA from apoB, and 
thereby facilitating apoB lipidation and VLDL 
assembly (Fig. 6J). 


Discussion 


Decades of research have established the role 
for circulating tPA in initiating the lysis of 
blood clots (56, 57). Intravenous infusion of 
recombinant tPA is approved as a thrombolytic 
therapy to restore blood flow in atherothrom- 
botic diseases, including ischemic stroke and 
myocardial infraction (58, 59). Conversely, plas- 
ma levels of the tPA inhibitor PAI-1 are positively 
associated with atherothrombotic disease (60). 
High plasma levels of apoB-lipoproteins pro- 
mote the arterial retention of these lipopro- 
teins, forming atherosclerotic lesions (61, 62). 
The atherosclerotic lesions can then progress 
to a state in which plaque rupture or erosion 


occurs, leading to acute thrombotic vascular - 


events, such as myocardial infarction and stroke 
(63). This study establishes a link between tPA, 
PAI-1, and plasma levels of atherogenic apoB 


lipoproteins, beyond the well-known roles of — 


tPA and PAI-1 in the occlusive thrombus that 
is the ultimate manifestation of atherosclerotic 
vascular disease. The mechanism involves tPA- 
PAI-1 interaction-mediated regulation of VLDL 
assembly in the liver through protein-protein 
interactions rather than through regulation of 
thrombolysis per se. These findings provide a 
possible mechanistic explanation behind the 
correlation between high apoB cholesterol 
and low tPA in CVD patients (13-15). More- 
over, the work has the potential for suggesting 
LDLR-independent therapeutic strategies for 
lowering cardiovascular risk. 

tPA is a multidomain protein consisting of 
finger, growth factor, Kringle 1 (K1), K2, and 
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Fig. 6. PAI-1 deficiency leads to 
lower concentrations of plasma 
apoB and apoB-cholesterol in 
mice and humans. (A) Serpinel™" 
mice were treated with AAV8-TBG-Cre 
(Cre) or control AAV8-TBG-LacZ 
(Ctrl) and then fed a high-fat diet 
for 8 weeks. Plasma total cholesterol 
concentration was measured (n = 6 
per group). (B) Serpinel”" mice 
were treated with AAV8-TBG-Cre 
(Cre) or control AAV8-TBG-LacZ 
(Ctrl) and then fed a high-fat diet 
for 8 weeks. Plasma apoB was 
measured by immunoblot (n = 6 
per group). (C) Serpinel”” mice 
were treated with AAV8-TBG-Cre 
(Cre) or control AAV8-TBG-LacZ 
(Ctrl) and then fed a high-fat diet 
for 8 weeks. Plasma samples were 
subjected to FPLC fractionation 
and assayed for cholesterol 
concentration. The cholesterol in 
VLDL fractions is shown in the 
smaller graph in the inset (n = 6 per 
group). (D) Serpinel™" mice were 
treated with AAV8-TBG-Cre (Cre) or 
control AAV8-TBG-LacZ (Ctrl) and 
then fed a high-fat diet for 8 weeks. 
Liver lysates were assayed fo 
PAl-1-free tPA concentration by ELISA 
(n = 6 per group). (E) Serpinel”" 
mice were treated with AAV8-TBG- 
Cre (Cre) or control AAV8-TBG-GFP 
(Ctrl) and then fed a normal chow 
diet for 4 weeks. Mice were injected 
with P407 i.p. to assess VLDL 
secretion (n = 10 per group). 

(F) Serpinel™" mice were treated 
with AAV8-TBG-Cre (Cre) or control 
AAV8-TBG-GFP (Ctrl) and then 

fed a normal chow diet for 

6 weeks. The mice had their food 
withdrawn for 5 hours and then 
were euthanized at 0, 2, and 
4 hours after oral gavage with olive 
oil. 
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serine protease domains (64). The extracellu- 
lar function of tPA in fibrinolysis is linked to 
its serine protease activity, which mediates 
the enzymatic conversion of plasminogen to 
plasmin, which carries out fibrinolysis (64). 
Our data suggest that the K2 domain, inde- 
pendent of its proteolytic activity, confers tPA’s 
regulatory function in the assembly of VLDL 
by direct binding to the apoB N terminus, there- 
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by inhibiting MTP-apoB interaction. The inhib- 
ition of MTP binding to apoB by tPA is likely to 
be competitive because both tPA and MTP 
bind to the lysine-rich domains of the apoB 
N terminus. However, whether apoB interacts 
with other lysine-binding proteins, such as plas- 
minogen, in the hepatocyte ER, and whether 
apoB and tPA interact in the circulation re- 
present topics for future studies. If these inter- 
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(J) A schematic diagram depicting how tPA-PAI-1 interaction in hepatocytes 
determines VLDL assembly. Without lipid stimulation, tPA interacts with apoB and 
inhibits MTP-apoB interaction in the ER, thereby limiting MTP-mediated apoB 
lipidation and VLDL assembly. When hepatocytes are loaded with lipid, PAI-1 
sequesters free tPA away from apoB and increases VLDL assembly. The diagram 
was generated using biorender.com. Data are shown as means + SEMs; P values 
were calculated by two-tailed Student's t test [(A), (D), (E), and (F)], paired 
Student's t test [(G) and (H)], or Pearson's correlation analysis (I). *P < 0.05. 


actions were found to occur, additional studies 
would then focus on the possible impact of 
these interactions on fibrinolysis and the non- 
fibrinolytic functions of these lysine-binding 
proteins (65-67). 

PAI-1 is a rapidly acting serine protease in- 
hibitor (serpin) of tPA (68). Upon binding to tPA, 
PAI-1 inactivates the serine protease activity of 
tPA and inhibits tPA’s other protease-independent 
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functions, such as tPA’s ability to bind fibrin 
(63) and tPA’s receptor-mediated activities (65). 
Previous studies have mostly focused on the 
tPA-PAI-1 interaction in the blood (69) and on 
the vascular endothelial surface (70), which 
are essential to maintain a balanced fibrino- 
lytic potential. Our findings show that tPA and 
PAI-1 also interact within hepatocytes, fine- 
tuning apoB-VLDL assembly. The intracellular 
interaction between PAI-1 and tPA has also been 
observed in other cell types, such as colonic 
epithelial cells (77). Gerard et al. observed that 
PAI-1-deficient mice have an increase in free 
tPA in the colon (77). These data are consistent 
with our findings that lowering hepatocyte 
PAI-1 expression leads to an increase in free tPA 
levels in both human primary hepatocytes and 
mouse livers. The impact of the intracellular 
interaction of tPA and PAI-1 in other cell types 
is worthy of investigating in future studies. 

Our findings provide possible mechanistic 
explanations for the associations of PLAT and 
SERPINE] polymorphisms with circulating lipid 
levels and atherosclerosis (72-75). Congenital 
tPA deficiency in humans has not been re- 
ported, which suggests that null mutations 
in the tPA gene may be lethal in utero. An 
insertion/deletion (I/D) polymorphism, 1s4646972, 
is located in the eighth intron of the tPA gene 
(76). Homozygotes for the deletion (DD) showed 
lower plasma tPA levels compared with those 
carrying an insertion locus (DI and II) (77). 
Moreover, homozygotes for the deletion (DD) 
had a threefold higher risk of intracranial artery 
atherosclerosis (72) and higher plasma total 
cholesterol and LDL cholesterol (73) compared 
with the homozygotes for the insertion (II). 
Subjects with the DD genotype have a lower 
forearm vascular release of tPA compared with 
the individuals with the II genotype (78). Two 
SERPINE! single-nucleotide polymorphisms 
(SNPs) have also been shown to be related to 
plasma lipid levels. The rs6950982 is associ- 
ated with elevated plasma PAI-1 levels (79) and 
higher total cholesterol, LDL cholesterol, and 
triglyceride (74). Similarly, the rs2227674 is asso- 
ciated with higher plasma PAI-1 levels (79) and 
increased triglyceride (75). Genome-wide associ- 
ation studies showed that genetic variants har- 
boring SERPINE1 loci, leading to elevated PAI-1 
levels, are associated with a higher risk of 
coronary artery disease (80, 87). Heterozygous 
PAI-1 deficiency is associated with a lower com- 
posite coronary artery disease risk score (55). 
Although further investigation is needed, these 
studies raise the possibility that targeting 
hepatocyte PAI-1 may have promise in treat- 
ing atherosclerotic CVDs. 

Moreover, because our data suggest that he- 
patic PAI-1-free tPA limits VLDL assembly and 
reduces blood apoB cholesterol, future studies 
should consider the genetic interaction of tPA 
and PAI-1 SNPs as well as SNPs that affect tPA 
and PAI-1 expression in hepatocytes. Two SNPs, 
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1s9399599 and rs7301826, are associated with 
liver expression of STXBP5 and STX2, both 
of which are exocytosis mediators (82). These 
SNPs are also associated with circulating tPA 
levels in humans (82, 83). Future population 
genetic studies are required to investigate the 
correlation of these SNPs—their associated 
expression quantitative trait loci (eQTLs) in 
hepatocytes—with atherogenic lipoprotein and 
lipid levels. 

We found that the formation of the tPA- 
PAI-1 complex in the hepatocyte is increased 
in response to fatty acid loading. Postprandial 
lipemia results in an influx of fatty acids into 
hepatocytes and stimulates the lipidation of 
apoB and subsequent VLDL secretion, which 
serves as a physiological response to an in- 
creased lipid burden within the hepatocyte 
(50). Fatty acid ingestion increases the degree 
and duration of the association between MTP 
and apoB in hepatocytes (35), which could 
contribute to fatty acid-induced VLDL se- 
cretion. However, the mechanisms by which 
fatty acids promote the association of apoB 
and MTP have remained unclear. Our in vivo 
and in vitro studies indicate an acute increase 
in the formation of the tPA-PAI-1 complexes 
within the hepatocyte after fatty acid stimu- 
lation without an alteration of either total tPA 
or PAI-1 protein levels in the cells. Moreover, 
hepatocyte PAI-1 deficiency leads to increased 
free tPA and reduced fatty acid-induced VLDL 
secretion. These data offer a mechanistic ex- 
planation behind the observed increase in 
VLDL secretion after postprandial lipid load- 
ing of hepatocytes. Additionally, apoB and 
MTP have similar functions in both hepato- 
cytes and intestinal epithelial cells, where they 
are involved in chylomicron assembly (84), but 
whether tPA in the intestine affects chylomi- 
cron assembly requires further study. 

Obesity increases the risk of atherothrom- 
botic events, such as myocardial infarction and 
stroke. Elevated PAI-1 in obesity could be a 
mechanistic link between obesity and increased 
CVD risk by both reducing fibrinolytic poten- 
tial and promoting dyslipidemia. Both plasma 
and liver PAI-1 levels are increased in obesity 
(19, 85), and hepatocytes are an important re- 
source of plasma PAI-1 (19, 86). Approximately 
60 to 70% of obese people have dyslipidemia 
with increased VLDL production (87). This 
study provides possible links among obesity, 
increased VLDL production, and CVD through 
hepatic PAI-1. 

Previous studies have shown that inhibiting 
VLDL production by using MTP inhibitors in- 
creases the risk of steatosis in humans, limit- 
ing the wide use of the agents. However, there 
are no reports to date that PAI-1 deficiency in 
humans is associated with liver steatosis. Addi- 
tionally, global PAT-1 inhibition or PAI-1 deficiency 
in mice attenuates hepatic steatosis (88-90). 
Notably, our study used a mouse model with 
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hepatocyte-specific PAI-1 deficiency (Fig. 6, A 
to F), whereas the studies mentioned above 
used mice with global PAI-1 deficiency (88-90). 
This distinction could be significant because 
PAI-1 is expressed in other cell types, including 
adipocytes (91). Moreover, mice with global PAT-1 
deficiency exhibit resistance to diet-induced 
obesity and display decreased adipocyte size 
compared with WT control mice (90, 92). Thus, 
the attenuated hepatic steatosis seen in global 
PAI-1 deficiency mice might be the result of 
overall metabolic changes, including body weight 
loss, which is not seen in mice with hepatocyte 
PAI-1 deficiency. Therefore, targeting tPA-PAI-1 
balance in the hepatocytes may be an ideal 
therapeutic strategy for dyslipidemia and ath- 
erosclerotic CVDs, but this possibility requires 
more extensive human-based studies. 

This study reveals that the tPA-PAI-1 inter- 
action determines VLDL assembly in hepato- | 
cytes. These findings provide mechanistic 
insight into the physiologic response to lipid 
ingestion and to the atherosclerosis-relevant 
process of excessive apoB-lipoprotein produc- 
tion by the liver. The latter insight may suggest 
ideas to therapeutically lower atherogenic 
lipoproteins to prevent the development of 
atherothrombotic vascular disease. 


Materials and methods summary 


Please refer to the supplementary materials for 
the complete materials and methods. 


Plasma from humans with PAI-1 deficiency 


Plasma samples were collected from members 
of the Berne Amish community, who harbor a 
frameshift mutation in SERPINEI (SERPINET / . 
n = 10) (55) and their age-, gender-, and BMI- 
matched control individuals from the same 
community (7 = 10). The institutional review 
boards (IRBs) at the Indiana Center for Hem- 
ophilia and Thrombosis and Medical College 
of Wisconsin (MCW) approved the study pro- 
tocols. The study participants provided written 
informed consent. 

Mice 

Plat™ mice were generated using WT C57BL6/J 
mice through the homologous recombina- 
tion in embryonic stem cell-based approach 
(Biocytogen, Wakefield, MA). Briefly, a tar- 
geting construct is designed to insert loxP sites 
into the introns 3 and 6, to flox the exons 4 to 
6 of Plat gene. This design is to conditionally 
knock out the exons 4 to 6 of Plat by Cre-loxP 
system. Hepatocyte tPA knockout mice were 
generated by administering Plat“ mice with 
an AAVS8 expressing a Cre recombinase driven 
by the thyroxine-binding globulin (TBG) pro- 
moter, AAV8-TBG-cre (Cre), and Plat™ mice 
receiving AAV8-TBG-GFP (GFP) were used as 
controls. To silence tPA expression in hepto- 
cytes, mice were intravenously injected with 
AAVS8 virus containing shPlat (AAV8-H1-shPlat) 
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(19). Age-matched control mice were injected 
with AAV8-H1-scramble silencing control. Ldlr’ 3 
Apoe’ ~, and WT C57BL/6J mice, used for si- 
lencing hepatocyte tPA, were purchased from 
Jackson Laboratory (JAX) (cat. nos. 002207, 
002052, and 000664, respectively). Ldlr’” and 
Apoe’ ~ mice were fed with WD (Teklad, cat. 
no. TD 88137). WT mice were on a standard 
chow (Lab diet, cat. no. 5053), DIO diet (Re- 
search Diets, cat. no. 12492), or WD (Teklad, cat. 
no. TD 88137). To express tPA in hepatocytes, 
WT C57BL/6J mice and holo-tPA-KO mice 
(Jax, cat. no. 002508), on a standard chow diet, 
received intravenous injection of AAV8- TBG- 
Plat. For all experiments, mice were main- 
tained on a 12-hour light-12-hour dark cycle 
with free access to normal chow, WD, or DIO 
diet and water. Mice of the same age and 
similar weight were randomly assigned to ex- 
perimental and control groups. Plasma lipids 
were assayed in blood collected after a 5-hour 
withdrawal of food. We use power calculations 
to determine the number of mice for each ex- 
periment. We include both male and female 
mice. Mice of the same age and weight are ran- 
domly assigned to groups; exclusion criteria are 
death, injury requiring euthanasia, or weight 
loss >10%, assuming these are rare events that 
are not statistically different between groups. 
All endpoint assays and analyses were con- 
ducted by researchers who were blinded to the 
identity of the cohort. All mouse experiments 
were performed with the approval of the In- 
stitutional Animal Care and Use Committee 
(IACUC) of Biomedical Resource Center at 
MCW and the Institutional Animal Care and 
Use Committee of Columbia University Irving 
Medical Center. 


Vector constructs 


AAV8-TBG-cre and AAV8-TBG-GFP were pur- 
chased from Addgene. As previously described 
(19, 20), AAV8-H1-short hairpin RNA (shRNA) 
construct targeting murine Plat was made by 
annealing complementary oligonucleotides 
and then ligating them into the pAAV-RSV- 
GFPH1 vector. AAV8-TBG-Plat was purchased 
from Vector Biolabs. Plasmid expressing WT 
human tPA, pCMV3-tPA-HA, was purchased 
from Sino Biologic (Beijing, China). The plas- 
mid constructs to express human tPA mutants 
were generated by Versiti BRI Core based on 
the pCMV3-tPA-HA. Constructed tPA mutants 
include tPA-S513A, tPA-KDEL, tPA-A-K2-HA, 
and tPA-D236, 238N. tPA-A-K2 refers to the 
tPA mutant with the replacement of K2 with 
K1, leading to the presence of two copies of K1 
but no K2. As previously described (93), the pur- 
pose of this design is to create a mutant tPA that 
lacks K2 but mimics normal tPA structure. 


Human primary hepatocyte experiments 


Human primary hepatocytes were obtained 
from the Liver Tissue Cell Distribution System 
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at the University of Pittsburgh (Pittsburgh, 
Pennsylvania, USA). All cells were cultured in 
Williams’ Medium E supplemented with He- 
patocyte Maintenance Supplement Pack (Thermo 
Fisher Scientific, cat. no. CM4000). Experiments 
were conducted as described in the figure leg- 
ends. Cells were harvested, and culture media 
were collected, snap-frozen in liquid nitrogen, 
and stored at -80°C until processing. The age 
and gender information of human donors are 
listed in table S3. 


Pulse-chase assay for apoB secretion 


Hepatocyte apoB-100 secretion was assayed 
using the *[H] labeling method as previously 
described (37). Human primary hepatocytes or 
McA-RH7777 cells were washed with leucine-free 
media and pulsed with ?[H] leucine (80 uCi/ml; 
160 Ci/mmol, Perkin Elmer, cat. no. NETTI66005MC) 
for 20 min. The 7[H] leucine-containing medium 
was removed, and cells were incubated with 
fresh Dulbecco’s minimum essential medium 
(DMEM) for an additional 0.5, 1, or 3 hours. apoB 
was immunoprecipitated from cell homogenates 
and media using anti-apoB antibodies (Sigma- 
Aldrich, cat. no. AB742). Nonlabeled apoB-100 
standards were added to the precipitates, and the 
samples were separated by SDS-polyacrylamide 
gel electrophoresis (SDS-PAGE) gel. Gels were 
silver stained, the bands corresponding to apoB- 
100 were excised, and radioactivity associated 
with apoB-100 was quantified by a scintillation 
counter. 


Neutral lipid transfer activity assay 


Neutral lipid transfer activity in cultured he- 
patocyte microsomal fractions was measured 
using a commercially available kit (Sigma- 
Aldrich, MAK110) as previously (36). Specif- 
ically, hepatocytes were homogenized in low 
hypotonic buffer (10 mM Tris-HCl, 1 mM EGTA, 
and 1 mM Mg(Cl2; pH 7.4) using a Polytron ho- 
mogenizer. Microsomes were isolated by ultra- 
centrifugation (SW55 Ti rotor, 50,000 rpm, 
1 hour). Neutral lipid transfer activity was as- 
sayed using the kit according to the manu- 
facturer’s manual. Isolated microsomes were 
incubated with donor vesicles containing fluo- 
rescent lipids and the acceptor vesicles-LDL. 
The fluorescence signal was self-quenched 
when labeled lipids existed within the donor 
vesicles but would be detected after lipids trans- 
ferred to acceptor vesicles. 


Solid-phase protein binding assay 


Solid-phase binding was performed in poly- 
styrene microtiter plates using an enzyme- 
linked immunosorbent assay (ELISA). Microtiter 
plate wells were coated with LDL at 5 ug/ml in 
coating buffer [tris-buffered saline (TBS)] over- 
night at 4°C. Unbound sites were blocked with 
3% nonfat milk in TBS for 1 hour at 37°C. After 
washing with TBS containing 0.05% Tween 20 
(TBS-Tween), tPA was added to the wells at 
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concentrations from 0 to 20 ug/ml in TBS-Tween. 
After a 1-hour incubation at 37°C, the wells were 
washed with TBS-Tween. Bound proteins were 
reacted with anti-tPA [1 ug/ml immunoglobulin 
G (IgG) in TBS-Tween in the presence of 3% 
nonfat milk] followed by goat anti-rabbit IgG 
conjugated to horseradish peroxidase. Trime- 
thylboron (TMB) substrates were added. After 
stopping the reaction, the absorbance at 450 nm 
was measured. 


SPR 


Studies of the binding of recombinant tPA to 
purified LDL were performed with a Biacore 
$200 SPR instrument (Biacore) using a CM5 
sensor chip (Cytiva, cat. no. 29149603) (94). LDL 
was attached to the chip using amine coupling 
chemistry, according to the manufacturer’s in- 
structions. In brief, the chip surface was pre- 
pared by exposing the carboxylated dextran | 
matrix to an aqueous solution containing 0.4 M 
1-ethyl-3-(3-dimethylaminopropyl) carbodiimide 
and 0.1 M N-hydroxysuccinimide (10 ,1/min for 
7 min). Then LDL (500 pg/ml in 10 mM sodium 
acetate buffer, pH 5.5) flowed across the chip 
surface at the same rate for 7 min, followed by 
1M ethanolamine-HCl (pH 8.5) at 19 ul/min for 
7 min to deactivate excessive reactive groups and 
remove any noncovalently bound LDL. This 
procedure led to ~6000 response units (RUs) 
of LDL immobilized. To monitor LDL associ- 
ation with tPA, tPA solutions in HBS-E buffer 
(Biacore) (0.01 M Hepes, 0.15 M NaCl, 3 mM 
EDTA, pH 7.4) in the concentration range of 
10 to 500 pg/ml were flowed across the chip 
at 20 ul/min for 6 min at room temperature. 
The dissociation of tPA was then monitored 
by washing the surface for 6 min with HES 
buffer alone. tPA flowed across an activated, 
but uncoated CM5 chip under the same condi- 
tions as the control for nonspecific binding. 


VLDL particle diameters analysis 


VLDL particles (the particle density is < 1.006 g/ml) 
were isolated by KBr density ultracentrifuga- 
tion (95) followed by two methods to measure 
their diameters. First, isolated VLDL particles 
were negatively stained with 20 g/L phospho- 
tungstic acid (pH 7.0) for 2 min and then viewed 
under a Philips CM10 electron microscope. 
The mean diameters of VLDL particles were 
determined using Image-Pro Plus 5.0 image 
analysis software. Second, the hydrodynamic 
diameters of isolated VLDL particles were 
measured using a Zetasizer uV dynamic laser 
light scattering instrument (Malvern Instru- 
ments) at 633 nm. VLDL samples were trans- 
ferred to a quartz cuvette, and light scatter 
readings were performed at 20°C (25-27). 


Statistics analysis 


The data described in the study were gener- 
ated from biological replicates. The number 
for human participants research and mouse 
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experiments are described in the figure leg- 
ends, where applicable. The in vitro cell ex- 
periments were repeated at least three times. 
All results are presented as means + SEMs. 
P values were calculated using two-tailed Stu- 
dent’s ¢ test for data that passed the normality 
test or the Mann-Whitney rank-sum U test for 
data that were not normally distributed. One- 
way analysis of variance (ANOVA) with post hoc 
Tukey’s test was used to evaluate differences 
among groups when three or more groups were 
analyzed. 


Study approval 


All mouse experiments were conducted with 
the approval of the IACUC and Institutional 
Biosafety Committee of MCW and the IACUC 
of Columbia University Irving Medical Center. 
The use of human cells and plasma samples 
in this study was approved by the IRB at the 
MCW and Indiana Hemophilia and Throm- 
bosis Center. All participants provided written 
informed consent. 
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these (9). The formation process of a MAD: sath = 
not been observed in either type of BH (super- 
massive or stellar-mass). Accretion flows onto 
stellar-mass BHs can evolve much faster than 
those around supermassive BHs. For example, 
outbursts in BHXRBs last for months to years, 
which potentially allows multiwavelength obser- 
vations to probe the formation of a MAD in 
those systems. 

Outbursts of BHXRBs have been explained 
with a disk-instability model (0). In this model, 
mass accumulates during quiescence in a cold 
and geometrically thin accretion disk (hereafter 
referred to as the thin disk). The increasing mass 
triggers a thermal-viscous instability when, at 
some point in the thin disk, the temperature 
becomes high enough to ionize hydrogen. This 
causes the system to enter an x-ray outburst as 
the thin disk empties material into the BH. . 

During an outburst, BHXRBs are observed 
in two very different spectral states—known as 
the hard state and the soft state—in which the 
x-ray emission is dominated by either hard 
(>10 keV) or soft (<10 keV) photons. The timing 
and radio-emission properties also differ be- 
tween the two states (JI, 12). The period between 
the two states is known as the state transition. 


Observations of a black hole x-ray binary indicate 
formation of a magnetically arrested disk 


Bei You", Xinwu Cao2*, Zhen Yan“, Jean-Marie Hameury*, Bozena Czerny°, Yue Wu 
Tianyu Xia"*°, Marek Sikora’°, Shuang-Nan Zhang”? Pu Du”, Piotr T. Zycki?° 


1,6,7 
, 


Accretion of material onto a black hole drags any magnetic fields present inwards, increasing their 
strength. Theory predicts that sufficiently strong magnetic fields can halt the accretion flow, producing 
a magnetically arrested disk (MAD). We analyzed archival multiwavelength observations of an outburst 
from the black hole x-ray binary MAXI J1820+070 in 2018. The radio and optical fluxes were delayed 
compared with the x-ray flux by about 8 and 17 days, respectively. We interpret this as evidence for 
the formation of a MAD. In this scenario, the magnetic field is amplified by an expanding corona, forming 
a MAD around the time of the radio peak. We propose that the optical delay is due to thermal viscous 
instability in the outer disk. 


n abinary system consisting of a black hole 

(BH) and a normal star, gas can be stripped 

off the star and drawn by the strong grav- 

itational field toward the BH (known as 

an accretion flow), where it forms an accre- 
tion disk. In the disk, matter moves inwards 
under the effect of viscosity. The system emits 
large amounts of radiation in the x-ray band, 
which allows it to be observed as a BH x-ray 
binary (BHXRB). Relativistic jets can be launched 
near the BH because of magnetic fields carried 
by the accretion flow in the vicinity of the BH 
(, 2). Numerical simulations have shown 
that any large-scale magnetic field present in 
the accreting material can be dragged inward 
through the accretion flow, which increases its 
strength. If the radial magnetic force becomes 
sufficiently high that it equals the gravita- 
tional force of the BH, theory predicts that 


the accretion flow will halt and form a mag- 
netically arrested disk (MAD) (3, 4). 
Sufficiently strong magnetic fields producing 
a MAD have been observationally inferred in 
accretion systems with supermassive BHs (5-8). 
MADs are also predicted to form in BHXRBs, 
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Fig. 1. Insight-HXMT light curves of MAXI J1820+070 during the 2018 outburst. Orange, black, and 
blue data represent the light curves from HE (27 to 150 keV), ME (10 to 30 keV), and LE (1 to 10 keV) 
instruments, respectively. Shaded regions indicate the periods during which the source was in the 
rising hard state (white), the hard-to-soft transition state (light purple), the soft state (gray), and the 
flare (orange), the latter of which includes the soft-to-hard transition state and the decaying hard 
state. Error bars, which are too small to be visible for most data, indicate the lo confidence interval. 
[Modified from (19), which is distributed under a Creative Commons Attribution License 4.0 (CC BY) 
https://creativecommons.org/licenses/by-nc/4.0/] 
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In the hard state, hot gas near the BH forms a 
corona—modeled as an advection-dominated 
accretion flow (ADAF)—in which a large frac- 
tion of the viscously released gravitational en- 
ergy is advected along the flow and enters the 
BH event horizon (73). Hard states are observed 
during the initial rise and subsequent decay of 
an outburst, which we refer to as the rising and 
decaying hard states. The soft-to-hard state tran- 
sition and the decaying hard state are accom- 
panied by a hard x-ray flare. These flares have 
been detected in optical, infrared, and radio 
bands, which is thought to relate with the 
formation of a jet (14, 15). However, the phys- 
ical mechanisms and processes are not fully 
understood. Hard x-rays during the flare are 
usually interpreted as inverse Compton scat- 
tering in the jet and/or in the ADAF (1D. 


Outburst of MAXI J1820+070 in 2018 


MAXI J1820+070 (also known as ASASSN-18ey) 
is an x-ray binary that consists of a low-mass star 
orbiting a stellar-mass BH (with BH mass 
My ~ 8.5Mo) with a period of 16.5 hours (6). 
It is located at a distance of 2.96 + 0.33 kpc 
(17). The system was identified during an x-ray 
outburst that occurred on 11 March 2018 (18); 
the outburst was observed by several x-ray 
telescopes (19-22). We analyzed archival ob- 
servations of MAXI J1820+070 by Insight 
Hard X-ray Modulation Telescope [Insight- 
HXMT (23)]. The Insight-HXMT data cover 
a broad energy band of 1 to 250 keV and 
were observed between 14 March 2018 and 
21 October 2018 [corresponding to modified 
Julian dates (MJDs) 58191 to 58412] (19, 20). 

The outburst of MAXI J1820+070 also ap- 
peared as an optically bright transient (24), 
which was monitored by the American Asso- 
ciation of Variable Star Observers (AAVSO), 
who produced a light curve (brightness as a 
function of time) that spanned from 12 March 
2018 to 21 December 2018 (MJD 58188 to 59184) 
(25). Radio emission from MAXI J1820+070 
was detected by multiple telescopes, which 
indicated the presence of a jet. We used the 
radio flux by Arcminute Microkelvin Imager 
Large Array (AMI-LA) from (26). 

These datasets provide high-cadence monitor- 
ing of the 2018 outburst of MAXI J1820+070 
over a broad range of wavelengths. The prox- 
imity of the source and its low extinction with 
color excess [E(B-V)] of 0.20 to 0.26 (27) make 
it sufficiently bright to investigate the accre- 
tion process of the magnetized flow near a BH. 

Figure 1 shows the Insight-HXMT light curves 
from the low-energy (LE, 1 to 10 keV), medium- 
energy (ME, 10 to 30 keV), and high-energy 
(HE, 27 to 150 keV) instruments between MJD 
58191 and 58412. We focused our analysis on the 
radio, optical, and x-ray observations of MAXI 
J1820+070 during the hard x-ray flare, including 
the soft-to-hard state transition and decaying 
hard state (MJD 58380-58412). 
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Fig. 2. Multiwavelength light curves of the flare. A distance of d = 2.96 kpc (17) was adopted in 
estimating the observed luminosity. (A) X-ray luminosity from the thermal emission of the thin disk (blue), 
the Compton-scattered emission from the ADAF (orange), and the total (black), integrated over energies 
0.1 to 100 keV as determined by our spectral fitting of the Insight-HXMT data (28). (B) The radio emission 
at 15.5 GHz measured by AMI-LA (26). (C) Extinction-corrected V-band luminosities from AAVSO (gray 
points) and an exponential model (cyan line) fitted to the data between MJD 58380 to 58390. Orange points 
represent the data after subtraction of the exponential model. Error bars, which are too small to be visible 


for most data, are lo confidence interval. 


Lags between flares at different 
wavelengths 

Figure 2A shows the x-ray luminosity of MAXI 
J1820+070 during the flare, which we deter- 
mined from the Insight-HXMT data (28). We 
fitted spectral models to these data to separate 
the luminosities of the accretion disk and 
Compton-scattered emission components, which 
are also shown. A radio flare at 15.5 GHz from 
AMI-LA is presented in Fig. 2B. The optical 
(V-band) data from AAVSO are shown in Fig. 
2C, which also exhibits a flare superimposed 
on the exponential decay of the underlying 
outburst. 
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We performed an interpolated cross-correlation 
function (ICCF) analysis (28) to determine the 
time lag between the radio and x-ray light curves 
and found a radio lag of 8.06*}:s8 days (fig. S1, 
uncertainties are 1c). ICCF analysis comparing 
the radio and the V band shows an optical lag of 
8.617}°6 days with respect to the radio, which 
indicates negligible contribution from the jet 
to the V-band flux (supplementary materials). 
ICCF analysis comparing the x-ray and the V 
band shows an optical lag of 17.6119 28 days with 
respect to the x-ray (fig. S1). This lag is far longer 
than the time taken for light to travel across the 
binary system, which is several seconds for an 
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A Onset of the flare, 
on ¢, ~ MJD 58380 


BH 


B Peak of hard X-ray emission, 
on t.~ MJD 58389 


C Peak of radio emission when a MAD is formed in the inner region of the ADAF, 


on t~ MJD 58397 ] Radio jet 
BH 


m 


Radio jet 


thin disk 


thin disk 


Fig. 3. Schematic diagram of our proposed interpretation. Purple arrows indicate the magnetic field 
lines. The outer thin disk is truncated at a radius Rj, within which an ADAF forms. (A) The configuration 
at the onset of the flare. We assume that the thin disk extends to ISCO and the ADAF expands with increasing 
R,,. (B) Configuration during the peak hard x-ray emission, when the outer thin disk brings both mass 

and magnetic field into the inner ADAF. (€) Configuration at the time of peak radio emission, when the 
magnetic field has been amplified by the ADAF. A MAD forms within the magnetospheric radius R,,, where the 
magnetic pressure equals the gravitational force (4). Movie Sl shows an animated version of this figure. 


orbital period of 16.5 hours; thus, the lag cannot 
be due to reprocessed x-ray illumination of the 
outer disk or the companion star. We used these 
cross-correlations to investigate the origins of 
the emission at each wavelength (28). 


Interpretation of the radio time lag 


Hard x-ray emission during the flare could be 
due to soft x-ray photons inverse Compton- 
scattered by hot electrons (Comptonization) 
within the jet or the ADAF. Given that the radio 
emission originated from the jet (26), but the 
radio flare lagged the x-ray by ~8 days, we ruled 
out the possibility that x-rays were emitted by 
the jet. We therefore suggest that the hard x-rays 
were most likely emitted by the ADAF. 

In the hard state of a BHXRB, a geometrically 
thick ADAF is thought to be located close to the 
BH, extending outward and connecting to a thin 
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disk with a truncated inner radius R;, (equal to 
the ADAF radial extent) (29, 30). We assumed 
that Ri: = Risco at to = MJD 58381 when the 
soft state ended, where Risco is the radius of 
the BH’s innermost stable circular orbit (ISCO). 
On the basis of the derived disk luminosity, 
which we estimated from the spectral fitting 
(28), we found that the truncation radius in- 
creased to Ry, = 6.79Rigco When the hard x-ray 
emission peaked at t; = MJD 58389, then to 
Ri = 32.83Risco by the time the radio emis- 
sion peaked at t, = MJD 58397 (28). Therefore, 
R,, increases with decreasing mass-accretion 
rate. In this case, the fraction of the total grav- 
itational power dissipated in the whole ADAF 
increases with R,,, whereas the power in the 
very inner part of ADAF—where the hard x-ray 
is mostly emitted—decreased with decreas- 
ing accretion rate [eq. $5; (28)]. These two 
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mechanisms compete, so a peak in the hard 
x-ray emission occurred when the latter 
mechanism becomes dominant around 4 = 
MJD 58389. 

Previous studies have suggested that a weak 
and coherent external magnetic field can be 
dragged inwards and amplified by an ADAF be- 
cause of its high radial velocity (31, 32). The 
external magnetic field is expected to be more 
strongly amplified by larger ADAFs (33). We 
therefore calculated the advection and diffu- 
sion of magnetic fields in the ADAF of MAXI 
J1820+070 (28). We found that the magnetic 
field B at Rysco was 2 x 10° G at tp ~ MJD 58381 
(the onset of the flare), 4.5 x 10’ G at 4, ~ MJD 
58389 (the peak of the hard x-ray emission), 
reached a maximum at 7 x 10’ G around fy ~ 
MJD 58397, and then declined. The jet power 
increased with the magnetic field strength near 
the BH, so its radio emission also increased until | 
t, = MJD 58397, which explains the observed 
delay of ~8 days with respect to the x-ray emis- 
sion. We estimate that the jet power at the 
radio peak as 10” erg s~!, which is consistent 
with theoretical predictions given the magnetic 
field of 7 x 10’ G (28). This supports our in- 
terpretation that the magnetic field is trans- 
ported and amplified by the expanding ADAF. 


Formation of a magnetically arrested disk 


In our interpretation, the increasingly strong 
magnetic field influences the subsequent accre- 
tion flow because the magnetic pressure acts 
against the gravity of the BH. Our calculations 
of the radius-dependent magnetic fields within 
the ADAF (28) show that the BH gravity still 
dominated over magnetic pressure at the time 
of the hard x-ray peak (¢, = MJD 58389). After 
this point, the magnetic field inside the expand- 
ing ADAF continued to be amplified, eventually 
becoming dominant at the inner edge. This 
caused a MAD to form around the time of the 
radio peak (t. = MJD 58397), when the mag- 
netic pressure became equal to the gravitation- 
al force (fig. S5) (28). 

We conclude that the increase of the trun- 
cated radius during the flare in MAXI J1820 
+070 led to the observed 8-day radio lag relative 
to the hard x-ray emission. A MAD was then 
gradually established in the ADAF as it con- 
tinued to extend radially. Figure 3 and movie S1 
illustrate our proposed scenario for the physical 
processes associated with the x-ray and radio 
flare and the formation of the MAD. 


Interpretation of the optical time lag 


We considered whether the long delay time of 
the optical flare can be produced by a viscously 
heated thin disk. In this scenario, the x-ray 
flare heats up the outer thin disk, reviving the 
thermal-viscous instability in the thin disk, which 
produces a delayed optical emission. We used a 
disk instability code (34) to determine the time- 
dependent evolution of the thin disk (Fig. 4) (28). 
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Fig. 4. Observed V-band flare compared to our simulations. The observed V-band luminosity (gray data 
points) is the same as in Fig. 2C. The curves represent the predictions from our DIM simulation, including 
one with a disk wind (red line) and one without a disk wind (dashed black line). 


We found that a small optical flare is produced, 
starting around MJD 58390 but fading away 
after a few days, which is much shorter than the 
observed lag of ~17 days. Therefore, the standard 
disk instability model (DIM) cannot explain this 
optical lag. 

A thin disk is expected to launch disk winds, 
which have previously been studied by using 
observations and simulations (35, 36). Previous 
work has found that a disk wind was present 
throughout the outburst of MAXI J1820+070 
(37). We therefore postulate that the disk wind 
was launched during the flare. Disk winds 
affect the profile of the outburst light curves of 
BHXRBs (38, 39) because the disk wind re- 
moves angular momentum from the disk, which 
is equivalent to increasing the viscosity. As a 
result, a brighter and more delayed optical peak 
is produced compared with the case of no disk 
wind. Using a simple parametrization of the 
disk wind, we performed additional simulations 
of the DIM (28). We found that the resulting 
V-band flare is consistent with the observed 
light curve, lagging the hard x-rays by more 
than 15 days (Fig. 4). 
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A machine-learning tool to predict substrate-adaptive 
conditions for Pd-catalyzed C-N couplings 


N. lan Rinehart!, Rakesh K. Saunthwal’, Joél Wellauer”, Andrew F. Zahrt!, Lukas Schlemper’, 
Alexander S. Shved', Raphael Bigler2*, Serena Fantasia**, Scott E. Denmark'* 


Machine-learning methods have great potential to accelerate the identification of reaction conditions for 
chemical transformations. A tool that gives substrate-adaptive conditions for palladium (Pd)—catalyzed 
carbon-nitrogen (C-N) couplings is presented. The design and construction of this tool required the 
generation of an experimental dataset that explores a diverse network of reactant pairings across a set 
of reaction conditions. A large scope of C—-N couplings was actively learned by neural network models 
by using a systematic process to design experiments. The models showed good performance in 
experimental validation: Ten products were isolated in more than 85% yield from a range of couplings 
with out-of-sample reactants designed to challenge the models. Importantly, the developed workflow 
continually improves the prediction capability of the tool as the corpus of data grows. 


he strategic value of carbon-nitrogen cou- 

plings makes them important transforma- 

tions in many domains of the chemical 

enterprise. In particular, Buchwald-Hartwig 

(B-H) couplings (/, 2) are among the most 
important C-N bond-forming reactions and 
have revolutionized the practice of modern 
synthetic organic chemistry (3). In this process, 
palladium complexes catalyze the cross-coupling 
of (hetero)aryl electrophiles with various nitro- 
gen nucleophiles. Experimentalists routinely 
identify substrate-specific conditions for new 
B-H couplings. The extensive scope of electro- 
philes and nucleophiles competent in this trans- 
formation required the development of many 
catalysts and conditions to enable successful 
couplings of diverse reaction partners (3, 4). 
Selection of the appropriate palladium ligand 
is particularly important because B-H couplings 
are exceptionally sensitive to changes in ligand 
structure (5). 

Empirical guides with selected examples from 
the literature and heuristics on the basis of re- 
ported couplings are available to help experi- 
mentalists select appropriate ligands and 
conditions for a given coupling (“prior expe- 
rience” in Fig. 1) (6-8). Because these recom- 
mendations are derived from literature data, 
they are limited to previous experience (i.e., 
retrospective). Specifically, Wuitschik’s guide 
(7) recommends the same conditions for all 
heteroaryl bromides, though more granular, 
heteroarene-specific conditions are often re- 
quired. Buchwald’s original user guide rec- 
ommends conditions for a limited range of 
heteroaryl bromides (pyridines, pyrazines, 
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pyrimidines, thiophenes, oxazoles, thiazoles, 
and pyrazoles) on the basis of work from just 
two references (9, 10) and otherwise focuses 
on recommending ligands for specific nucleo- 
phile types. For example, 2-amino oxazoles are 
a challenging class of nucleophile that re- 
quired a specific publication from Buchwald’s 
group (17). More recently, high-throughput ex- 
perimentation has been used to evaluate a 
range of five-membered heteroaryl bromides 
in B-H couplings, and that work highlights the 
difficulty of couplings between five-membered 
heteroaryl bromides and aliphatic heterocy- 
cles (72). Even with these published reports, the 
chemical literature does not come close to de- 
scribing the enormous scope of possible B-H 
reactant pairings; thus, when a new (hetero)aryl 
halide is used, experimentalists must rely on 
intuition. 

The use of B-H couplings often creates a bot- 
tleneck in routine synthesis campaigns in both 
academia and industry. An experimentalist 
begins with a specific chemistry problem: cou- 
pling of a new pair of reactants (Fig. 1). They 
then identify a subset of conditions on the basis 
of prior knowledge, amalgamating recommen- 
dations from the B-H user guides and B-H cheat 
sheets described above (when recommendations 
are available), as well as personal experience, 
intuition, and specific literature precedent. Those 
prior knowledge-based recommendations serve 
as a starting point for an experimental cam- 
paign to survey catalyst-solvent-base com- 
binations for experimental hits. For many 
applications, a broad range of compounds must 
be synthesized in a timely manner, and hits 
are an end point. In practice, an experimen- 
talist will invest time and resources into 
an optimization campaign to fine-tune those 
conditions only if necessary. Of note, the Doyle 
group recently published a machine learning 
(ML)-based tool that accelerates the optimiza- 
tion of yield from a user-defined reaction space 
(“reaction optimizer” in Fig. 1) (13). Our goal 
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ately provides predicted hits for a new L-- 
posed coupling (which could then be optimized 
if necessary), offering more than empirical 
guides and avoiding an experimental cam- 
paign from an empirical approach, thereby 
accelerating the routine application of B-H 
couplings (Fig. 1). This goal is complementary 
to optimization, and we foresee that the com- 
bination of the two could potentially create an 
end-to-end artificial intelligence-driven proc- 
ess like that shown in Fig. 1. 

From an ML standpoint, there are crucial 
differences between an optimizer tool and the 
tool proposed in this work. A visual illustra- 
tion contrasting reaction optimization to a 
tool based on substrate-adaptive models is 
shown in Fig. 2. A three-dimensional plot rep- 
resents a hypothetical reaction space where 
any specific combination of reactant(s) and 
condition(s) produce an unknown yield, and 
the goal of using ML is to use relatively few 
measured yields to predict the rest. After se- 
lecting a specific coupling from all those pos- 
sible [a slice of a hypothetical reaction space 
along the reactant dimension(s); top of Fig. 2], 
an optimizer directs the selection of experi- ‘ 
ments within that slice of reaction space to 
increase yield. Sophisticated optimizers are 
still being developed to make iterative rounds 
of experimentation as efficient as possible (74). * 
An optimizer could, in principle, be used on 
multiple reactants at once, but the reactants 
must have related reactivity trends (i.e., the 
slices in reaction space are close enough that 
similar conditions have similar reactivity). Be- 
cause B-H couplings are so sensitive to reactant 
structure (vide supra), a new slice—even close in 
the reactant dimension(s)—frequently requires 
a fresh start. A complementary approach to 
the optimizer, shown on the right in Fig. 2, « 
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Fig. 1. The goal of this work. Identifying conditions 
that furnish synthetically useful yields for new 
couplings without experimental campaigns. 
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Fig. 2. Defining substrate-adaptive models and contrasting them with ML-assisted optimization 


models. 


addresses this limitation because it involves train- 
ing models on the entire reaction space before- 
hand. Thus, when a new specific coupling is 
selected, those models can immediately lever- 
age prior learning to predict the yield patterns 
of that new reaction without additional ex- 
periments. This approach produces predicted 
hits, which may still need validation, but cir- 
cumvents an experimental campaign. That ap- 
proach requires an appropriate experimental 
dataset that provides sufficient prior experi- 
ence to the substrate-adaptive models. Before 
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embarking on this enterprise, it was prudent 
to evaluate state-of-the-art applications of ML 
to predict the outcome of new B-H coupling 
reactions and related work. 

In 2018, Doyle and co-workers trained ML 
models to predict the yields of B-H couplings 
between one nucleophile and several similar 
electrophiles (15). By the introduction of is- 
oxazole additives (catalyst poisons) (16), the 
authors simulated the process of gathering 
data for many different reactant pairs while 
avoiding the analytical challenge of having 
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many more distinct products. Models effec- 
tively predicted the extent of catalyst poisoning 
by the additives and were tested on out-of- 
sample additives. Importantly, their models 
were not applied to predicting couplings with 
new nucleophiles or electrophiles and thus 
do not address the problem outlined in Fig. 1. 
Recent follow-up work using that same data- 
set is subject to the same inherent limitations 
and does not address the problem outlined 
above (17, 18). 

A year later, Li and Eastgate combined pro- 
prietary and literature B-H data to create clas- 
sification models to predict which ligand to 
use for a specific B-H coupling for the desired 
synthetic route (19). Although the models suc- 
cessfully leveraged prior experiments to make 
predictions, experimental validation is lim- 
ited to reactants that furnish one specific di- 
arylamine product. Wuitschik and co-workers | 
very recently reported their efforts using a 
similar approach to that of Li and Eastgate 
but with a robust experimental validation that 
demonstrated poor model performance (20). 
The authors attribute that to the limitations of 
the dataset and suggest that new approaches 
such as the workflow presented in this work 
are needed for B-H couplings, citing the asso- 
ciated preprint of this work (27). 

In 2022, Zimmerman and co-workers pub- 
lished an interesting approach to transfer 
learning on B-H couplings, wherein models 
were trained on one type of reactant (i.e., benz- 
amide) and used to predict effective condi- 
tions for a new substrate (i.e., pyrazole or an 
aniline) (22). Their goal was to maximize the 
value of limited data from one reaction to as- 
sist in the development of a new reaction, ef- 
fectively learning one slice of a reaction space 
from Fig. 2 and applying that knowledge to 
another. When two slices show positive trans- 
fer, this approach avoids training models on 
the new reaction. In Zimmerman e¢ ail.’s work, 
positive transfer was observed between ani- 
line and benzamide, whereas pyrazole and 
benzamide couplings showed poor transfer. 
That observation supports the premise that 
general conditions are unlikely to exist for B-H 
couplings. 

In avery recent disclosure, Burke, Grzybowski, 
and co-workers reported general conditions 
for Suzuki-Miyaura (S-M) cross-couplings (23). 
Their approach bypasses using predictive 
models that respond to substrate structure, as 
described in Fig. 2. Instead, the goal of general 
conditions is to work across the many slices of 
the reaction space of S-M couplings, and their 
work relies on the assumption that those exist. 
Unfortunately, general conditions are unlike- 
ly to exist for B-H couplings (vide supra). The 
structural variation of diverse nitrogen nu- 
cleophiles and resulting changes in reactivity 
necessitate domain-specific conditions, and 
decades of research have not solved all those 
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challenges. Zimmerman et al.’s work demon- 
strating poor transfer within B-H couplings 
illustrates the distinct reactivity domains of 
B-H coupling reaction space. As a case in point, 
Burke, Grzybowski, and co-workers used their 
workflow on a dataset of B-H couplings (24); 
however, no model predictions or recommended 
general conditions are presented. 

Although also unrelated to B-H coupling re- 
actions, Doyle and co-workers reported suc- 
cessfully modeling of a noncatalytic functional 
group interconversion of carbinols to alkyl 
fluorides (25). The authors collected a dataset 
capable of training ML models to learn the in- 
herent reactivity patterns of many substrates 
for that reaction. Those models then success- 
fully predict substrate-specific conditions for 
new reactants, as illustrated in Fig. 2. Doyle et al’s 
approach is proof of concept for designing a 
prediction tool for substrate-adaptive con- 
ditions. However, by virtue of having a single 
reactant and no catalyst, that transformation 
has a substantially lower complexity. 


Research strategy 


The reactant dimension for B-H coupling reac- 
tion space like that in Fig. 2 actually comprises 
multiple subdimensions, and the conditions 
dimension also captures solvents, bases, and 
catalysts. All those dimensions are indepen- 
dent (nucleophile, electrophile, catalyst, solvent, 
and base), and all affect yield. Importantly, 
each reactant can show different preferences 
with regard to catalysts, solvents, and bases 
(6, 8). As a result, models must learn the pref- 
erences of each reactant and the interaction 
terms between various combinations of them 
and then correctly weigh those to be useful. 
The data used to train such models must ex- 
plore these complex relationships, and no such 
dataset of appropriate complexity exists. As a 
case in point, Schwaller et al. show that mod- 
eling on the US Patent and Trademark Office 
(USPTO) dataset of B-H couplings, which ex- 
plores a broad range of reactants in a noncom- 


Fig. 3. Representative scope of 
nitrogen nucleophiles for the 


binatorial fashion, did not produce predictive 
models (17). A dataset with a similar reactant 
diversity to that of the USPTO dataset is ideal, 
but that exact dataset cannot support models 
complex enough to address the problem out- 
lined in Fig. 1. 

To build such a dataset, the reactant dimen- 
sions must be unbounded so that it is possible 
to continue expanding to new reactant do- 
mains without starting over. Zimmerman et al.’s 
observation of poor transfer and good transfer 
between different reactant domains of B-H 
reaction space indicates that there may be types 
of couplings that can be grouped and learned 
together and others that must be separately 
addressed. A new strategy for dataset design 
that is founded on separating reactant domains 
is proposed. By combining expert knowledge, 
new chemical descriptors, and well-established 
clustering techniques, representative neighbor- 
hoods (subspaces) of the multidimensional B-H 
coupling reaction space could be identified. 
Then, in a subsequent experimental campaign, 
new data in new subspaces could be iterative- 
ly generated, and the applicability domain of 
models expanded when models are updated 
with that new data (vide infra). 

Before running experiments, we had to de- 
fine a starting point for exploring reactant 
dimensions in B-H couplings. The reactivity 
patterns and pitfalls of this important trans- 
formation are immensely complex: Indoles 
form constitutional isomers (8, 26), benzylic 
and cyclic aliphatic amines are prone to an 
unproductive electrophile reduction pathway 
(8), and many heterocycles undergo unproduc- 
tive, palladium-catalyzed C-H activation path- 
ways, to name a few (27). The relative rates of 
these unproductive pathways, catalyst deac- 
tivation, and productive coupling all affect the 
yield and vary from substrate to substrate. For 
a model to predict useful conditions for new 
B-H reactions, it must learn under which cir- 
cumstances these side reactions appear as well 
as the inherent structure-reactivity patterns 
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and then (b) an algorithmic method 
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chemical descriptors developed in 
this work. The structures shown are 
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out of 8 total 


out of 6 total 


for each reactant, catalyst, and condition. Ac- 
cordingly, to make this work as more than just 
a proof of concept, problematic substrates like 
those described above were included. 

Figure 3 shows a representative list of 19 of 
the 50 nitrogen nucleophiles used in this work. 
In a similar manner, 50 (hetero)aryl bromides 
were selected to broadly represent many build- 
ing blocks of interest to pharmaceutical de- 
velopment as well as a range of electronic and 
steric properties [see supplementary mate- 
rials (SM) for the full list]. The scope of both 
Zimmerman et al.’s and Doyle et al.’s work on 
B-H couplings is subsumed within and greatly 
expanded upon by the scope of this study. The 
boxed structures represent the good and poor 
transfer learning, represented by colored ar- 
rows, that was demonstrated by Zimmerman 
and co-workers (22). 

To realize effective couplings across sucha | 
broad range of reactants, many catalysts were 
necessary. Buchwald- and Beller-type phos- 
phine ligands were identified because (i) they 
share a similar backbone structure, and (ii) 
more than 50 such ligands are commercially 
available and were developed to address the 
broad scope of C-N couplings (3, 28). To cap- 
ture that range, 20 ligands were selected to 
represent the crucial ligand dimension of the 
reaction space with a combination of algorith- 
mic selection and expert knowledge (see SM for 
details). In the interest of practical applicabil- 
ity, models were trained to make predictions 
for solvents and bases used at the bench. Thus, 
two inorganic bases—potassium carbonate and 
sodium tert-butoxide—and one organic base— 
1,8-diazabicyclo[5.4.0]undec-7-ene (DBU)—were 
selected. Finally, 1,4-dioxane, toluene, and tert- 
amyl alcohol were selected as representative 
solvents because they broadly represent cyclic 
ethers, arenes, and alcoholic solvents regularly 
used in B-H couplings (see SM for selection of 
catalyst and conditions). Thus, the reaction 
space for this study—with nucleophile, electro- 
phile, ligand, solvent, and base dimensions— 
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contains 450,000 possible reactions from 180 
conditions (3 bases times 3 solvents times 20 
ligands) and 2500 reactant pairs (50 amines 
times 50 bromides). As mentioned, this space 
already represents an intractable number of 
experiments for combinatorial experimentation. 

The first experiments were designed to rep- 
resent the reaction space broadly with 23 dif- 
ferent algorithmically selected reactant pairs 
and a systematically varied set of conditions 
for each (see SM). Extensive experimental de- 
velopment identified reproducible conditions 
on the 0.5-mmol scale in a 24-tube parallel re- 
actor. Twenty-four conditions were evaluated 
for each reactant pair out of the 180 possible 
(20 catalysts times 3 solvents times 3 bases). 
This approach balanced the need to rapidly 
explore new reactant pairings (i.e., slices from 
Fig. 2) with the need to explore enough condi- 
tions to learn that slice of the reaction space 
(i.e., points on each slice from Fig. 2). The data 
showed 63% of experiments with 0% yield, 
and 82% with less than 20% yield. The paucity 
of hits from which models needed to learn was 
a problem. To generate a higher fraction of 
hits in the data, a new strategy was needed for 
evaluating the condition component of the 
B-H reaction space. 

To increase the number of positive hits, ML 
models were trained to recognize zero and 
nonzero yield patterns. Deep feed-forward neu- 
ral networks were trained to classify reactions 
as zero- or nonzero-yielding using that first 
dataset and showed an average accuracy of 87%. 
Although unable to distinguish between a 1%- 
yielding and a 99%-yielding reaction, such a 
classifier could still increase the number of 
nonzero-yielding reactions that are run. Eigh- 
teen reactant pairs from the first 23 gave sig- 
nificantly more nonzero data (the rest were 
hypothesized to be chemically challenging using 
expert knowledge). The initial results from those 
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18 reactant pairs showed 56% zero-yielding 
reactions, and only 12% with a >80% yield. 
Twenty-four new conditions were selected from 
the 156 remaining conditions (180 total, with 
24 already evaluated) using the classifier to 
predict nonzero-yielding conditions. The new 
results from those 24 classifier-selected con- 
ditions contained just 22% zero-yielding reac- 
tions, and 29% of the data showed >80% yield 
across all 18 reactant pairs (see SM for details). 
Thus, even a limited binary classifier could 
be used to improve the yield distribution of 
new data. 

Those first models allowed us to connect 
steps 1 to 4 in a new workflow depicted in Fig. 4. 
This workflow begins and ends with the ex- 
perimentalist. Thus, (i) a new reactant pair is 
selected by the experimentalist, (ii) the tool 
calculates the corresponding chemical descrip- 
tors, (iii) the tool then uses models to predict 
the yield of all 180 conditions, and (iv) the 
experimentalist can decide which conditions 
to evaluate on the basis of both the predictions 
and their expert knowledge. The second half 
of the workflow, steps 5 to 8, describes the 
process of domain expansion by (v) including 
new data, (vi) retraining models with that new 
data, (vii) testing those models in control ex- 
periments, and (viii) having the experimental- 
ist evaluate model performance. At this point 
in the cycle, the experimentalist again inter- 
venes with expert knowledge and their evalu- 
ation of model performance to select the next 
reactant pair to target. 

To build the desired dataset, the experimen- 
talist directs what the model learns by their 
selection of the next reactant pairs. This pro- 
cess relies on defined reaction subspaces to tar- 
get for domain expansion (vide supra). Figure 
5A depicts the map of reaction subspaces used 
to select the next reactant pairs, which was 
constructed from the output of the process in 
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Fig. 4. New, experimentalist-driven, active-learning workflow for exploration of reaction space. 
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Fig. 3. The B-H reactant space map is an array 
with amines organized by cluster on the ver- 
tical axis and bromides organized in the same 
manner on the horizontal axis. Each small 
square represents a coupling between a spe- 
cific bromide and a specific amine; each rec- 
tangular subdomain therefore represents a 
set of couplings between a cluster of amines 
and acluster of bromides. The varying size of 
the clusters is a consequence of the reactants 
selected in Fig. 3A. The entire array is orga- 
nized through the lens of new chemical de- 
scriptors. The new descriptors combine a radial 
distribution of atoms within each reactant 
around the reaction center with various atomic 
properties relevant to reactivity, producing 
what we call a radial distribution function (see 
SM). The initial algorithmically selected re- 
actant pairings that were used to build the 
first classifier model are shown in light green. 
The workflow in Fig. 4 was then used, and re- 
actant selections were made in an approach 
similar to the transfer learning conducted by 
Zimmerman and co-workers but on a larger 
scale; many such transfer steps were made 
to expand the domain of models (by moving 
to a new subspace). Then, for each new sub- 
space, multiple reactant pairs were evaluated 
to enable learning of substrate-level trends, 
making models robust in that subspace. The 
results of those many selections are shown 
in dark green. 

The dataset can be described as a network, 
and the goal of this work is to explore enough 
connections (reactant pairings) to make infer- 
ences about the missing connections that are 
possible. To visualize this, a structured chord 
diagram is depicted in Fig. 5B, showing amine 
and bromide nodes on either side with edges 
connecting reactants that were coupled in the 
dataset. Similar to the map in Fig. 5A, reac- 
tants are organized by cluster on either side, 
and the edge bundling in the figure illustrates 
that the data are distributed across exemplars 
from each cluster. The first visualization em- 
phasizes that 121 out of the possible 2500 com- 
binations of substrates were evaluated in this 
work. The sparsity of these data is a feature, 
not a flaw; identifying 121 out of 2500 couplings 
and 24 out of 180 conditions to evaluate re- 
duced the experimental burden from 450,000 
experiments (180 conditions and 2500 sub- 
strate pairs) to about 3300 experiments. The 
diversity of the dataset is best represented 
by the reactants evaluated (see SM for the full 
list). Using randomly partitioned data, models 
achieved a mean absolute error (MAE) of 9% 
in an external test set. However, the problem 
outlined in Fig. 1 is best represented by a test 
set of out-of-sample reactant pairs, which is a 
more difficult test. 

After evaluating multiple couplings in each 
subspace, models predicted reactivity trends (but 
not necessarily exact yields) for new reactants 
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Fig. 5. Visualizations of B-H reaction space reactant component. (A) Dataset distributed across reactant 
clusters in a two-dimensional grid. (B) A structured chord diagram showing the network connectivity of the 
reactant pairs sampled in the dataset. One exemplar is shown from each cluster. 


with the 180 conditions. For example, the 
highest-yielding predictions may be about 
50% and the lowest about 0%, whereas the 
experimental data might range from 85 to 0%. 
However, the linear correlation of predicted 
and observed yields would be high, with a 
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slope in the range of 0.5 to 0.75. Once models 
could predict reactivity trends, much more data 
was required to achieve accurate yields. Ulti- 
mately, the trade-off between accurate pre- 
dictions and solving the problem outlined in 
Fig. 1 required that we accept this as an ex- 
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perimental limitation and continue domain 
expansion before reaching yield accuracy in 
every subspace. Although models sometimes 
showed limited accuracy, they were still able 
to address the key problem because they could 
identify the yield trends for the various reac- 
tion conditions out of the 180 available. Ac- 
cepting this limitation, a pragmatic, ordinal 
ranking of predictions was used. Predictions 
were divided into quartiles (25% of the total 
predicted range), and these quartiles were 
labeled as “high,” “moderate,” “low,” and “poor.” 
In practice, multiclass classification models (in- 
stead of regression models with this ordinal 
ranking of predictions) were slightly less per- 
formant. To test this system as a tool, models 
trained on the resulting dataset were experi- 
mentally evaluated with new B-H couplings. 


Experimental validation 


To evaluate the performance of these models, 
the tool was tested in a typical use case: New 
couplings were selected in which one or both 
reactants were not seen by the model (out-of- 
sample) and were tested using an experimen- 
tally tractable number of conditions (one to 
five in most cases) with the highest predicted 
yields. To ensure that this test was rigorous, the 
number of catalysts evaluated for each cou- 
pling was limited to 3 out of the 20 available. 
This precaution minimized the possibility that 
a selected catalyst would work well by chance 
instead of models learning meaningful chem- 
ical structure and reactivity relationships. When 
the experiments were carried out, predictions 
were ranked into quartiles, and, as they were 
available, a few conditions in the top quartile(s) 
were evaluated. A complementary set of con- 
ditions predicted to give low yields were also 
evaluated to test whether the models were 
correctly learning reactivity trends as described 
above. Together, conditions with high and 
low predicted yields bookend a representative 
experimental yield range for each coupling (see 
SM for details). Figure 6 presents the results of - 
experimental validation, including (i) a chem- 
ical structure indicating out-of-sample reactants 
in red, in-sample reactants in blue, and the 
coupling locus highlighted by the bold bond; 
Gi) the number of experiments evaluating dif- 
ferent conditions; (iii) the subset of conditions 
with a high predicted yield (hits) that were 
tested, represented as circles; (iv) an indication 
of success by the coloring of the circle (green, 
experimental yield in the top quartile), failure 
(red, experimental yield not in the top quar- 
tile), or a special case (gray); (v) the highest 
isolated yield obtained from those predicted 
hits; and (vi) a heatmap showing all 180 pre- 
dicted yields for use as a visual fingerprint of 
the response of the models to changing re- 
actant structure. 

To interpret experimental validation re- 
sults, success and failure must be defined. The 
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Fig. 6. Experimental validation 
of substrate-adaptive models 
as condition recommenders. 
Out-of-sample (red) and in-sample 
(blue) reactant fragments are indi- 
cated for all products. Circular 
iconography indicates the number 
of predicted hits that were tested; 
green indicates a success, and 

ed indicates a failure. Gray color is 
explained in the text. The highest 
isolated yield is indicated below 
the circular icons. The prediction 
heatmaps share the legend 
shown for compound m. The 
scale indicates the highest yield 
prediction for each coupling 
(middle number) and color 
scale. Cy, cyclohexyl; Ph, phenyl; 
THP, tetrahydropyranyl. 
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hypothesis behind the experimental validation 
is that models can correctly recommend syn- 
thetically useful hits for couplings with one 
or more new reactants by predicting reactivity 
trends. A successful experiment was defined as 
one in which the predicted hit furnished a 
yield in the top 25% of those observed for that 
coupling (a top quartile yield). For practical 
reasons, not all 2340 experiments (13 reactant 
pairs in Fig. 6 times 180 conditions) were eval- 
uated; instead, 187 experiments (both hits and 
zero-yield predictions) were performed across 
all 13 reactant pairs. Note that although this 
experimental design does not test whether all 
high-yielding conditions were identified, in 
10 out of 13 cases, the highest observed yields 
were >85%, and all three of the remaining 
cases were expected to furnish a lower yield 
(vide infra). The full details of predicted and 
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observed yields, correlations between them, 
and common error metrics are provided in 
the SM. 

Our claim that yield trends are sufficient to 
gauge performance is illustrated by the first 
five validation experiments in Fig. 6. The cou- 
plings to form the products a to e showed very 
good predicted-observed correlations [0.93, 
0.89, 0.91, 0.77, and 0.75 coefficient of deter- 
mination (R”); see SM]. Poor residual errors 
[>25% MAE, >34% root-mean-square error 
(RMSE)] were observed for a, ec, and e be- 
cause the highest yield predictions of only 
33, 49, and 57% led to the highest observed 
yields of 98 to 99%. Moderate-to-good residual 
errors were observed for b (14% MAE, 18% 
RMSE) and d (9% MAE, 18% RMSE). Couplings 
ato cshare the same nucleophile, and the best 
conditions changed among those couplings, 
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demonstrating that the models adapted to elec- 
trophile structure. 

Next, the coupling of azepane with a protected 
2-bromobenzyl alcohol derivative (product f) 
used two out-of-sample reactants. The results 
showed very good accuracy, with 4% MAE and 
7% RMSE and a good correlation of 0.80 R? 
between predicted and observed yields. By 
contrast, the coupling of piperidine with 3- 
bromo-5-methylpyridine (product g) involves 
two in-sample reactants (the only example here 
of coupling two in-sample reactants) but the 
combination was never tested in the dataset. 
The results were satisfying, with an accuracy 
of 12% MAE and 16% RMSE and a correla- 
tion of 0.86 R®. Comparing prediction finger- 
prints of f and g provides some insight into 
the models’ ability to adapt predictions to 
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substrate structures: For the more sterically 
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hindered bromide in coupling f, the less bulky 
diarylphosphino-substituted ligand 4 (CPhos 
hybrid) is predicted to be the best. For the un- 
hindered bromide in coupling g, bulkier tri- 
cyclohexylphosphine-substituted ligands 17, 
20, and 21 (CPhos, RuPhos, and SPhos) are pre- 
dicted to be superior. 

In the coupling of indole and 2-bromothiazole 
(product h), the bromide was new to the mod- 
el. This simple coupling is reported using copper 
catalysis only (29-32) and is a difficult cou- 
pling for palladium catalysis. As a result, this 
experiment tests the ability of the model to 
identify couplings that will not work. Indeed, 
the models predicted a maximum 23% yield, 
with mostly near- and zero-yield predictions. 
Experimental evaluation of the five-highest 
predictions showed that one condition pro- 
vided a 25% yield, and the rest furnished no 
product. Within the confines of this experi- 
mental design, it appears that the model suc- 
cessfully identified a challenging coupling; 
however, to fully substantiate that claim, all 
180 conditions would need to be evaluated. 
An experimentalist with those predictions 
should pursue another synthetic method if it 
is available. 

By contrast, the coupling of tert-butyl pyro- 
glutamate with 2-(3-bromopheny])-1,3-dioxolane 
(both of which are out-of-sample, product 1) 
represents an extrapolation because the pyro- 
glutamate (i.e., contains an ester group) is 
substantially different from anything in the 
dataset. As was seen with product h, the four 
top predictions ranging from 59 to 68% yield 
were evaluated, yet the highest observed yield 
was 17%, a clear failure on the part of the models 
to recognize a challenging, extrapolative cou- 
pling. We observed that the nucleophile was 
converted to the carboxylate in situ by “H quan- 
titative nuclear magnetic resonance (qNMR) 
and hypothesized that this by-product pre- 
vented coupling by coordinating Pd. Those 
complications are excellent examples of real 
challenges in generalizing this reaction that 
prevent the tool from accurately identifying a 
low-yielding reaction because it had not learned 
to predict such complicated behavior. 

2,6-Dimethylaniline derivatives represent a 
class of nucleophile that is new to the models 
because all other primary (hetero)aromatic 
amines included in the dataset have no or- 
tho substitution. Twenty-four conditions were 
evaluated for three separate pairings of 2,6- 
dimethylanilines with one out-of-sample bro- 
mide (4-phenoxybromobenzene, product i) 
and two in-sample bromides (3-bromoquinoline 
and 3-bromonitrobenzene, products j and k). 
The fractions of top-quartile predictions that 
furnished yields in the top quartile of those 
observed were 7/7, 7/9, and 7/9, respectively. 
That i to k represent extrapolations is more 
evident in their range of poor to moderate R” 
of 0.70, 0.05, and 0.40, respectively. The ob- 
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served yield ranges for the best predictions 
evaluated were 76 to 99%, 83 to 99%, and 77 
to 89% yields, respectively. These results sug- 
gest that for a subclass of anilines not repres- 
ented in the dataset, models were still able to 
identify high-yielding conditions. Thus, a trac- 
table number of experiments (7 to 9 out of the 
24 evaluated) were predicted to furnish good 
yields and seven did for each of the three 
couplings. 

Finally, 5-fluoroskatole was included as a 
nucleophile for its similarity to indole but its 
inability to undergo C(3) arylation (product 
m). While acquiring the dataset, we regularly 
observed the C(3) arylation products—often as 
dominant species—effectively forcing models 
to predict C-N couplings that are in competi- 
tion with C(8) aryl coupling. As chemists with 
an understanding of reaction mechanisms, it 
is intuitive that a minor structural perturba- 
tion such as adding a methyl group to the C(3) 
position of indole will prevent competitive C(3)- 
arylation. The models being evaluated do not 
learn mechanisms or the implications of such 
a minor structural change (and corresponding- 
ly minor change in the descriptors) as adding 
a methyl group to that position of an indole. 
The experimental validation results show a poor 
correlation between predicted and observed 
yields and stochastic errors of predicting zero- 
and nonzero-yielding conditions, suggesting a 
different observed reactivity pattern than what 
the models had learned. Despite this, out of 
the four conditions with the highest predicted 
yield (40 to 55%), two provided good yields of 
84 and 85%. This example illustrates that the 
tool can still be useful, even on a challenging 
coupling that represents a substantial mecha- 
nistic extrapolation. 

Taken together, the results of the validation 
experiments demonstrate that the performance 
of the model exists on a gradient. For new re- 
actants from a reaction subspace that is well 
represented in the dataset, predictions are ro- 
bust (i.e., a to e and g to f). For cases in which 
reactants represent structural permutations 
from those in the dataset (i.e., d and e), the 
models correctly learned reactivity trends and 
could predict hits. For new types of structures 
that may have different reactivity patterns 
than those in the dataset, performance ranged 
from moderate (i to k) to poor (i and m). 
However, even the lowest model performance 
demonstrated here provided good yields (de- 
pending on the chemical limitations of the 
coupling in question, e.g., 1). Most importantly, 
poor model performance can be rescued by 
domain expansion of the dataset (see fig. S22). 
This approach to network exploration using 
active learning has enabled us to create mod- 
els that are useful over a broad applicability 
domain by evaluating only 0.7% of the reaction 
space. Although we will continue to explore this 
space, more importantly, we have now created 
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a transportable blueprint for domain expan- 
sion of a dataset. 


Outlook 


The dataset described herein comprises >120 
reactant pairs that systematically explore a 
microcosm of B-H coupling space. Models 
trained on these data simultaneously learned 
nonlinear reactivity trends for many different 
classes of reactants. These models could pre- 
dict the yield of reactions with a mean absolute 
error of 9% using randomly partitioned data 
and were performant at reactant generaliza- 
tion, as demonstrated by out-of-sample sub- 
strate validation. 

Key to achieving this goal was an informatics- 
guided strategy that reduced the experimental 
impossibility of exploring a 450,000-member 
reaction space to an experimentally tractable 
problem of acquiring a dataset comprising only 
3300 experiments. We present both this val- 
idated tool for Pd-catalyzed C-N couplings as 
well as an active-learning workflow, which, un- 
like prior work, was used to build an expansive 
dataset for the chemical community. The chem- 
istry community can engage with this work on 
four different levels. An experimentalist with 
no interest in ML can use the snapshot of the 
tool presented in this work without expertise 
in ML or programming and expect performance 
similar to experimental validation. We invite 
any practitioner with an interest in ML to take 
the tool and resume the workflow in Fig. 4, 
honing the tool to new reactant domains of in- 
terest or steadily improving prediction accuracy 
on existing dataset domains. Furthermore, we 
invite any practitioner with expertise in ML to 
use the new active-learning framework on other 
important reactions with expansive multire- 
actant spaces. Finally, we offer this dataset for 
focused development on modeling noncombi- 
natorial, diverse datasets, which are rare in the 
chemistry domain. 
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Enhanced warm, salty subarctic inflows drive high-latitude atlantification, which weakens oceanic 
stratification, amplifies heat fluxes, and reduces sea ice. In this work, we show that the atmospheric 
Arctic Dipole (AD) associated with anticyclonic winds over North America and cyclonic winds over 
Eurasia modulates inflows from the North Atlantic across the Nordic Seas. The alternating AD phases 
create a “switchgear mechanism.” From 2007 to 2021, this switchgear mechanism weakened northward 
inflows and enhanced sea-ice export across Fram Strait and increased inflows throughout the Barents 
Sea. By favoring stronger Arctic Ocean circulation, transferring freshwater into the Amerasian Basin, 
boosting stratification, and lowering oceanic heat fluxes there after 2007, AD+ contributed to slowing 
sea-ice loss. A transition to an AD- phase may accelerate the Arctic sea-ice decline, which would further 


change the Arctic climate system. 


he Arctic region is rightfully called a fron- 

tier for global climate change. Linked to 

atmospheric circulation, radiative forc- 

ing, and a host of climate feedback mech- 

anisms, Arctic surface air temperatures 
are rising at least three times faster than global- 
average air temperatures (1). The Arctic Ocean 
is warming faster than the global ocean (2). 
Sea-ice decline is a true indicator of climate 
change, affecting all aspects of life in the north- 
ern high-latitude regions (7). One of the rea- 
sons for sea-ice loss is the warming Arctic 
Ocean, which is caused in part by anomalous 
inflows from the North Atlantic and North 
Pacific (3-5). System-wide changes in Arctic 
basins caused by anomalous inflows from 
the Nordic Seas are referred to as atlantifi- 
cation (5-7). One of the many manifestations 
of atlantification in the Eurasian Basin of the 
Arctic Ocean is decreased upper-ocean strat- 
ification and enhanced heat release from the 
warm intermediate (150- to 800-m depth) 
Atlantic water layer, which results in accel- 
erated loss of sea ice (5, 8, 9). However, these 
changes are complex, and their driving forces 
and interactions with the Arctic atmosphere- 
ice-ocean system are not well understood. 
This study identifies important mechanisms 
that steer high-latitude atlantification, which 


‘nternational Arctic Research Center and College of Natural 
Science and Mathematics, University of Alaska Fairbanks, 
Fairbanks, AK 99775, USA. “Institute of Marine Research, 
Bergen, Norway. “International Arctic Research Center, 
University of Alaska Fairbanks, Fairbanks, AK 99775, USA. 
4Geophysical Institute and College of Natural Science and 
Mathematics, University of Alaska Fairbanks, Fairbanks, AK 
99775, USA. °Woodwell Climate Research Center, Falmouth, 
MA 02540, USA. °Alfred-Wegener-Institute, Helmholtz Centre 
for Polar and Marine Research, D-27570 Bremerhaven, 
Germany. ’Polar Science Center, Applied Physics Laboratory, 
University of Washington, Seattle, WA 98105, USA. ®Bjerknes 
Centre for Climate Research, Bergen, Norway. 
*Corresponding author. Email: ivpolyakov@alaska.edu 


Polyakov et al., Science 381, 972-979 (2023) 


inform a broad and comprehensive under- 
standing of system function. 


Sea-ice changes 


Although the end-of-summer ice extent and 
thickness are declining, our results indicate that 
the rate of decline slowed after 2007 compared 
with 1992 to 2006 (Fig. 1, A and B). Over the 
satellite record, the trend in summer ice extent 
during 2007 to 2021 (—0.07 + 0.18 x 10° km? per 
decade) is weak and not statistically significant 
in contrast to the much larger negative trends 
in 1979 to 2021 (-0.79 + 0.13 x 10° km? per 
decade) and 1992 to 2006 (-0.99 + 0.51 x 10° km? 
per decade). Thus, a more stable regime of Arctic 
sea ice appears to have begun in 2007. This 
transition was abrupt, with 2007 setting a record 
for a single-year sea-ice-extent decrease of —-1.6 x 
10° km? (compare with 2012’s second record- 
year drop of -1.0 x 10° km?) (10). 

Similarly, the composite record of mean winter 
(February to March) ice thickness in the cen- 
tral Arctic, now close to ~2 m, has not changed 
markedly since 2007 (Fig. 1A), even though the 
multidecadal decline has been significant. In 
addition, the mean thickness in fall (October 
to November) has remained above the 1-m low 
established after the end of the summer of 2007. 
Between 2003 and 2007, the thinning was 
notable and occurred with the loss of a large 
fraction of thick multiyear sea ice (11). The 
overall thinning has slowed since 2007, when 
satellite-based records of ice thickness began 
(ICESat, ICESat-2, and CryoSat-2, 2003 to 2021). 
The Arctic is now dominated by the behavior 
of thinner seasonal ice, which now solely con- 
trols the variability of ice thickness in the cen- 
tral Arctic (72, 13). 


Atmospheric changes 


The atmosphere over the Arctic Ocean is dom- 
inated by high pressure (known as the Polar 
High) centered over the western Arctic (Fig. 
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(clockwise) circulation. This drives predomil.-- 
features of sea-ice drift and upper-ocean cir- 
culation known as the Beaufort Gyre in the 
Amerasian Basin as well as the Transpolar 
Drift flowing from the Siberian shelf toward the 
Canadian Archipelago and Fram Strait (Fig. 1G). 

The primary mode of variability of the pan- 
Arctic sea level pressure is known as the Arctic 
Oscillation, and the related wind pattern ac- 
counts for the observed climatological features 
of the atmospheric circulation. Beginning in 
2007, however, the secondary Arctic Dipole 
(AD) pattern, which features higher sea lev- 
el pressure over the Beaufort Gyre and the 
Canadian Archipelago along with lower sea 
level pressure over the Siberian Arctic, became 
dominant [Fig. 2C; see also (14-16)], whereas 
the Arctic Oscillation remained close to neu- 
tral (fig. S4). This shift is evident in the higher | 
spatial correlation between the mean 2007 
to 2021 sea level pressure and the AD (R = 0.59, 
where R is the correlation coefficient), com- 
pared to that with the Arctic Oscillation (R = 0.47). 

The AD index, a characteristic of the wind 
cyclonicity in the central and Siberian Arctic 
(14), varies between positive (AD+) and nega- 
tive (AD-) over ~15-year regimes (see wavelets 
in figs. S2 and S3). During 1992 to 2006, both 
the AD and Arctic Oscillation spring-summer 
indices were slightly negative (fig. S4, D and E), 
whereas during 2007 to 2021, the AD index be- 
came increasingly positive (fig. S4E). 

The AD+ drives an enhanced anticyclonic 
Beaufort Gyre and Transpolar Drift (Fig. 2) (77). 
Distinct from the Arctic Oscillation (18, 19), 
the across-pole AD pattern results in increased 
heat advection into the Arctic, especially along 
the Siberian shores (Fig. 2B), and contributes 
to higher surface air temperatures (fig. S3). The 
AD was a major driver of the second record- 
low sea-ice extent in summer 2007 during the 
satellite record (20). The Fram Strait sea-ice 
export is correlated with the AD index during 
1979 to 2014 [significant R = 0.45, (17); see Fig. 
2D], with a stronger link to the AD than to the 
Arctic Oscillation (27). 

Most important for this study, the alternat- 
ing AD phases were pivotal for a switchgear 
mechanism that modulates the relative strength 
of the Fram Strait and Barents Sea branches 
transporting Atlantic water into the Arctic Ocean. 
For example, the anomalous atmospheric forcing 
during the AD+ in 2007 to 2021 was favorable for 
reduced flows into the Arctic through the Fram 
Strait along with enhanced inflows through the 
Barents Sea Opening (Fig. 2, B, E, and F). The 
Arctic Oscillation pattern contributes to large- 
scale cyclonicity in Arctic atmosphere, ocean, 
and ice circulation (22) but not the finer de- 
tails suggested by the switchgear mechanism 
discussed here (fig. S4). The AD-driven forcing 
is best developed in spring and summer (figs. 
S2 and S5) but can affect air temperatures, 
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Fig. 1. Loss of Arctic sea-ice thickness and extent. (A) Arctic sea ice 
thickness changes for autumn (red and dotted red) and winter (blue and dotted 
blue). Shadings (blue and red) show 1 standard error (S.E.) ranges from the 
regression analysis of submarine ice thickness and expected uncertainties in 
satellite ice thickness estimates. Data release area of submarine data ice 
thickness data are shown in the inset. Satellite ice thickness estimates are for 
the Arctic south of 88°N. Thickness estimates from more localized airborne/ 
ground electromagnetic surveys near the North Pole (diamonds) and from 
Operation IceBridge (circles) are shown within the context of the larger scale 
changes in the submarine and satellite records (52). EM, electromagnetic. 

(B) September Northern Hemisphere (NH) sea-ice extent (blue line), the long- 
term trend (green line), and different regimes of sea-ice extent change in 1992 to 
2006 and 2007 to 2021 (red segments); the inset shows available potential 
energy (APE) anomalies in the upper ocean (surface mixed layer and halocline) 
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for 2007 to 2017 relative to 1992 to 2006 (increasing APE signifies stronger 
stratification suppressing mixing). (© and D) Maps of trends for the sea-ice 
concentration (percentage per year) and the summer biweekly (time-integrated) 
TI-NDVI (decade) over Arctic tundra for 1992 to 2006 (C) and 2007 to 2021 
(D). The NDVI is a proxy for vegetation productivity and is derived from remotely 
sensed products. A positive NDVI trend means that the vegetation has more biomass 
and more photosynthetic productivity; this process is called greening. A negative 
NDVI trend means that the vegetation has less biomass and is less healthy, and 

it is called browning. (E and F) September sea-ice extent trends (E) and the 
TI-NDVI (F) for 1992 to 2006 (blue) and 2007 to 2021 (red) as a function 

of longitude. (G) Diagram of sea ice drift and upper ocean circulation (blue arrows), 
as well as Atlantic water circulation (red arrows) (53). FJL-NZ, TPD, and BG 
indicate Franz Joseph Land-Novaya Zemlya pass, TransPolar Drift, and 

Beaufort Gyre, respectively. 
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Fig. 2. Atmospheric forcing governing the switchgear mechanism. (A) Annual sea level pressure 
(SLP)(hPa) averaged over 1992 to 2006. (B) Annual pressure anomalies (hPa, shading) in 2007 

to 2021 relative to 1992 to 2006. Vectors show corresponding anomalous geostrophic winds. (€) April to 
July AD (hPa) pattern, which is correlated with the 2007 to 2021 pressure anomalies at R = 0.59. (D to 

F) Time series of atmospheric parameters. (D) April to July AD (blue) and March to August ice area transport 
across Fram Strait [green, from (19)] are reduced to anomalies by subtracting means (Mn) and normalized 
by SDs; AD and ice export are correlated at R = 0.44. (E and F) Annual wind across Fram Strait (E) and 
Barents Sea Opening (BSO) (F). In (D) to (F), red horizontal segments show 1992 to 2006 and 2007 


to 2021 means. 


surface energy fluxes, storm tracks, sea-ice 
drift and exports, and upper-ocean circulation 
in all seasons (fig. S3). 


Changes in the Arctic Ocean 
Switchgear mechanism between the Fram Strait 
inflow and Barents Sea throughflow 


Water exchanges between the Nordic Seas and 
the Arctic Ocean are critically important for 
the state of the Arctic climate system. Ocean 
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Reanalysis System 5 (ORASS) reanalysis data 
(see the supplementary materials for details) 
suggest that these exchanges in the upper 50 m 
were amplified across the northern Barents Sea 
Opening and in the northern and central 
Barents Sea while being reduced across the 
Fram Strait in the past 15 years (Fig. 3), with 
similar patterns but weaker anomalies in the 
50- to 200-m layer (fig. S6). A time series of 
currents across the Barents Sea Opening and 
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Fram Strait clearly show this alternating pat- 
tern, with 5% increased annual mean currents 
in the Barents Sea Opening and 15% decrease 
in the Fram Strait (Fig. 3, C and D). Summer 
and fall ORAS5 transports were the most not- 
able contributors to anomalous Barents Sea 
Opening inflows (15% increase), whereas spring 
and summer processes dominated Atlantic 
water inflows through the Fram Strait (28% 
decrease in current speed) (fig. S7). At the 95% 
confidence level, each of the aforementioned 
estimates of anomalous transports is statisti- 
cally significant. In comparison to the Barents 
Sea Opening, changes in the upper 50-m water 
volume transports were ~2.2 times as strong 
in the Fram Strait. As a result, changes in trans- 
ports through these gateways do not counter- 
vail, and the importance of other gateways in 
establishing the Arctic Ocean’s water balance 
must not be underestimated. On weekly to _ 
monthly timescales, increased eastward flow 
in the northern Barents Sea and weakened in- 
flow across the Fram Strait are associated with a 
northward shift in atmospheric cyclones over 
the Barents Sea (23). 

Amplified Barents Sea Opening inflows in 
2007 to 2021 resulted in increased transports 
across the Franz Joseph Land-Novaya Zemlya 
pass by 23%, thus providing an enhanced 
inflow from the Barents Sea into the Arctic 
Ocean (Fig. 3G and fig. $8). These changes 
were crucial for the new state of the eastern 
Arctic Ocean brought on by atlantification (5). 
Tracer experiments showed a doubled prob- 
ability that water parcels in the upper 50 m 
crossed the Barents Sea in 2007 to 2021 com- 
pared with 1992 to 2006, in contrast with the 
Fram Strait, where the number of tracers de- 
creased over the same time by a factor of four 
(Fig. 4, D and E). Stronger impacts of the local 
winds on the Barents Sea compared with Fram 
Strait inflows are consistent with stronger to- 
pographic steering and more complex flow in 
the Fram Strait (24, 25). 


These findings are strongly supported by - 


both in situ and satellite observations. For ex- 
ample, mooring and reanalysis records are posi- 
tively correlated and show consistent increasing 
(decreasing) trends of currents across the Barents 
Sea Opening (Fram Strait) over the past 15 years 
(fig. S9). A modest correlation R = 0.34 be- 
tween the Barents Sea Opening mooring and 
reanalysis time series is likely because moor- 
ings do not cover the northern regions where 
the reanalysis shows the greatest increase in 
flow. Moreover, anomalies in the satellite-based 
sea surface height provide further confirmation 
for the switchgear mechanism modulating 
the inflows through the Barents Sea and Fram 
Strait during the current AD+ (Fig. 4G). Geo- 
strophic currents forced by the anomalous 
2007 to 2021 sea surface height were ampli- 
fied in the northern Barents Sea Opening and 
in the central Barents Sea but showed little 


3 of 8 


RESEARCH | RESEARCH ARTICLE 


difference in the southern Barents Sea Open- 
ing. However, the sea surface height indicates 
an anomalous flow from the Lofoten Basin to 
the Barents Sea in 2007 to 2020 (Fig. 4G). The 
possibility of the Lofoten Basin feeding the 
Barents Sea has been suggested by (26), at- 
tributed to the vigorous eddy activity in the 
Lofoten Basin (27, 28), and is further linked to 
the atmospheric wind forcing (29). In contrast 
to the Barents Sea Opening, the geostrophic 
flow across the Fram Strait weakened (Fig. 4G). 


Imprints of alternating AD patterns on the 
Arctic Ocean circulation 


The Arctic basins responded to the AD+ atmo- 
spheric regime in 2007 to 2021 with basin-wide 
changes in the upper Arctic Ocean circulation 
associated with an amplified Beaufort Gyre, 
stronger boundary currents along the Siberian 
slope, and a shifted Transpolar Drift from the 
Amerasian Basin toward the Lomonosov Ridge 
(Fig. 3B and Fig. 4, A and B). 

Since 2007, a smaller but more intense Arctic 
high, associated wind-driven circulation, and 
convergence of the upper Beaufort Gyre have 
resulted in enhanced freshening and thicken- 
ing of the surface fresh layer in the Amerasian 
Basin (22, 30). The Beaufort Gyre mooring 
record provides confirmation of these findings 
(Fig. 5, A and B). This evidence is further sup- 
ported by observed sea-ice melt, redirected 
Siberian riverine waters into the Beaufort Gyre, 
increasing inflow of relatively fresh Pacific Water 
through the Bering Strait, and strengthened strat- 
ification between the Amerasian Basin’s surface 
and deep layers (22, 30-32). At the same time, 
reanalysis and mooring observations showed 
contrasting changes in the Eurasian Basin, with 
increased salinification and weakened stratifica- 
tion in the halocline, along with amplified up- 
ward heat fluxes (Fig. 5, C and D) (5, 33). 

Our tracer experiments support these find- 
ings (Fig. 4, A to C, and fig. S11). For example, 
the 2007 to 2021 cyclonic atmospheric forcing 
widely dispersed transports of freshwater from 
the Siberian shelves (where the Transpolar Drift 
originates) and drove a substantial portion (17% 
of all trajectories) of the Siberian freshwater 
into the Beaufort Gyre. By contrast, during the 
anticyclonic 1992 to 2006 AD- phase, not a 
single water parcel ended up trapped in the 
Beaufort Gyre, and they instead left the central 
Arctic through the straits of the Canadian Ar- 
chipelago. Changes in the spatial distributions 
of meteoric water (i.e., precipitation including 
water from lakes and rivers) provide confir- 
mation of the diversion of freshwater from the 
Eurasian Basin to the Amerasian Basin, with 
meteoric water content decreasing in the 
Eurasian Basin and increasing in the Amer- 
asian Basin, consistent with (34, 35). These 
changes in meteoric water content are accom- 
panied by a general increase in net sea-ice melt- 
water (fig. S12). 
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Fig. 3. Changes in upper 50 m oceanic circulation from ORAS5 reanalysis between alternate AD 
phases. (A and B) Maps of (A) 1992 to 2006 annual mean current speed |U| and (B) 2007 to 2021 |U| 
anomalies (relative to the 1992 to 2006 mean) of the Arctic Ocean and Nordic Seas region with removed 
low-frequency (1/30 years cut-off frequency) components by using running mean filtering. (€ to E) Time 
series of the ocean annual mean currents inflowing into the Arctic through Fram Strait (80°N, 14°W to 10°E) 
(C), the Barents Sea Opening (71°N to 77°N, 20°E) (D), and the FJL-NZ passage (76.7°N to 80.6°N, 
60.5°E to 64.3°E) (E). Red horizontal lines show means over 1992 to 2006 and 2007 to 2021. Transfer 
coefficients from current in centimeters per second to water transport in sverdrups (Sv) (1 Sv = 10° m°/s) 
are given in each time series panel. Note the reduced (enhanced) inflow through Fram Strait and 
enhanced (reduced) Barents Sea throughflow in the years 2007 to 2021 (1992 to 2006). 


In the Eurasian Basin, the local effects of al- 
ternating AD patterns and the remote effects 
of atlantification owing to changing influxes 
across Fram Strait and Barents Sea are inter- 
connected in an ice/ocean heat feedback mech- 
anism. Weakened stratification and increased 
oceanic heat fluxes associated with atlantifica- 
tion drive sea-ice melt, which is amplified by 
increased oceanic heat fluxes through increased 


convective entrainment in winter (5, 9). A stron- 
ger coupling between atmosphere, ice, and ocean 
in the eastern Arctic in the recent decade and 
intensified upper-ocean currents play critical 
roles in developing this feedback (Fig. 3B) (36). 


Changes in the Nordic Seas 


Anomalous anticyclonic winds over the Nordic 
Seas, as evident during the AD+ in 2007 to 2021 
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Fig. 4. Difference of the oceanic circulation patterns between alternate 

AD phases. (A to G) A shift in the Transpolar Drift [(A) to (C)], inflows of Atlantic 
water from the Nordic Seas across the Barents Sea Opening and Fram Strait 
[(D) to (F)], and anomalous (2007 to 2021 minus 1992 to 2006) sea surface 
height (SSH) (cm) and corresponding geostrophic currents (G). (A to C) ORAS5- 
based trajectories of parcels exiting the central Siberian shelf in 1992 to 2006 (A) 
and 2007 to 2021 (B) and the probability of finding a parcel within the polygon 


(indicated with the green line) north of the Canadian Archipelago (C). (D to F) 
ORASS5-based trajectories of parcels released in the Nordic Seas along the 
sections indicated with the black line in 1992 to 2006 (D) and 2007 to 2021 (E) 
and the probability of finding a parcel traveling across the Barents Sea Opening, 
Fram Strait, and FJL-NZ [indicated with the green lines in (D) and (E)]. (G) 
Satellite-based anomalous SSH and geostrophic currents showing the switchgear 
mechanism from Fram Strait to Barents Sea Opening inflow after 2007. 


(Fig. 2B), weakened the poleward Atlantic wa- 
ter flow from south of the subpolar gyre and 
through the Nordic Seas (25, 37). Attributed 
to a weak reinforcement of the subpolar gyre 
(fig. S13) that altered the properties of the 
inflowing Atlantic water (38), the Atlantic 
water inflow from the northern North At- 
lantic became colder and fresher in the late 
2000s (Fig. 5, Q and R) (39). The Atlantic water 


temperatures and salinities in the Nordic Seas 
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and the Barents Sea Opening responded to this 
change by peaking in the late 2000s (Fig. 5, K to P). 

Consistent with these changes, the salinity 
trends in the Nordic Seas after 2006/2007 are 
negative and similar in magnitude from the 
North Atlantic up to the Barents Sea Opening 
(Fig. 5, R, P, N, J, and I). Atmospheric cir- 
culation over the Nordic Seas associated with 
the AD+ in 2007 to 2021 (Fig. 2B) weakened 
the northward Norwegian Atlantic Current 
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(Fig. 3B) (40), which is consistent with a neg- 
ative trend of —-3 cm/s per decade shown by 
the mooring record from the Svingy section, the 
gateway for the Atlantic water into the Nordic 
Seas (fig. S9). Despite these changes in the 
Nordic Seas, the salinity in the eastern Eurasian 
Basin halocline and Atlantic water increased 
(Fig. 5D), driven by salinification in the up- 
stream northern Barents Sea due to lack of 
meltwater input from seasonal ice melt (47). 
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Fig. 5. Contrasting trends of ocean temperature and salinity in the region 
spanning from the northern North Atlantic to the Arctic Ocean during the 
positive (2007 to 2021) and negative (1992 to 2006) phases of the AD. 

(A to R) Annual time series of water temperature and salinity from repeated 
observations from the Atlantic water layer (for depth ranges, see supplementary 
materials; blue lines) and halocline (150 m, black lines) (see details in materials 


The cooling trend in 2007 to 2021 is reduced 
along stream the Atlantic water from the 
northern North Atlantic into the Barents Sea 
and Fram Strait (-0.55°C in the North Atlantic/ 
Rockall, —0.36°C at Sviney, and insignificant 
farther north; Fig. 5). These changes in the 
Nordic Seas are supported by the reduced 
oceanic heat loss over the past decades (42). 
Thus, the traditional paradigm of Atlantic 
water variability associated with either cold 
and fresh or warm and saline Atlantic water 
has changed (Fig. 5, E to N). Moreover, our 
analysis confirms a northward amplification 
of oceanic warming (7), with the origin in the 
Nordic Seas, which makes this a source region 
of atlantification and Arctic amplification. 

The observed temperature and salinity trends 
are also consistent with the switchgear-driven 
stronger flow through the Barents Sea Opening 
compared with Fram Strait flow after 2006/ 
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South Cape (SC), and North 


2007, with temperatures at the Barents Sea 
Opening and northeastern Barents Sea show- 
ing little or no decrease (Fig. 5, G and K), where- 
as the Fram Strait temperatures fell rapidly (Fig. 
5, Eand I). Since the mid-2000s, the along-track 
heat loss of the Fram Strait inflow has increased 
(43), whereas the heat loss of the Barents Sea 
throughflow has decreased or remained sta- 
ble (43, 44), which thereby implies a stronger 
warming of the Atlantic water entering the 
Arctic from the Barents Sea as compared with 
Fram Strait. Thus, a combination of the switch- 
gear mechanism and regional changes in heat 
loss make the Barents Sea a key contributor to 
the Arctic Ocean atlantification. 


Discussion 


This research identifies the mechanisms driv- 
ing atlantification and paints a broad and 


comprehensive picture of changes in the north- 
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and methods). Observations with substantial gaps were complemented by ORASS 
reanalysis time series [gray lines; for the eastern Eurasian Basin (EEB), we plotted 
ORASS temperature, adding 1°C]. Ninety-five percent statistically significant trends are 
shown by solid red lines and black font; otherwise, dashed red lines and gray font 

are used. The Beaufort Gyre (BG), Barents Sea Opening (BSO), Fram Strait (FS), * 
Eastern (NE) Barents Sea are indicated on the map. 


ern high-latitude climate system. Switchgear is - 
one of these mechanisms resulting from alter- 
nating AD atmospheric regimes. We discovered 
increased Atlantic water inflows throughout 
the Barents Sea and reduced inflow across Fram 
Strait, which resulted in a stronger warming of 
the Barents Sea as compared with Fram Strait 
during 2007 to 2021. This regime was also as- 
sociated with the amplified Beaufort Gyre, 
stronger boundary currents at the Siberian 
slope, and a shifted Transpolar Drift from the 
Amerasian Basin toward the Lomonosov Ridge. 
One of the most notable changes associated 
with AD variability is the 2007 to 2021 hiatus 
of Arctic summer sea-ice loss, which, we ar- 
gue, is a response to enhanced redistribution 
of freshwater into the Amerasian Basin caused 
by anomalous winds and increased stratifica- 
tion that suppress oceanic heat fluxes. This 
process is regionally limited to the Amerasian 
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Basin, where the sea-ice area has actually in- 
creased since 2007 (Fig. 1, D and E). Thus, 
although variations in atmospheric forcing 
may affect the ice-loss slowdown since 2007 
(45), the shutting down of oceanic heat fluxes 
by increasing Amerasian Basin stratification 
(Fig. 1B, inset) may help drive the ice-loss 
hiatus. To validate this hypothesis, we used the 
winter survival of the near-surface tempera- 
ture maximum created by the summer trap- 
ping of solar radiation below the surface mixed 
layer (10- to 30-m depth) (46). By using an ex- 
tensive 2007 to 2020 archive of Ice-Tethered 
Profiler observations, we found that 65% of 
vertical temperature profiles showed the pres- 
ence of the near-surface temperature maximum 
(and therefore negligible upper Amerasian 
Basin ventilation) during October to March (fig. 
S14; see materials and methods for details)—a 
convincing argument that enhanced sea-ice 
winter growth owing to reduced ocean heat 
fluxes contributed to the observed 2007 to 2021 
ice-loss hiatus. Sea-ice dynamics should not be 
disregarded, however. For example, despite in- 
creased ice-area export (Fig. 2D), decreased 
ice thickness in the Fram Strait (10) may cause 
a decline in ice volume export, which would 
offset the decline in net ice production in high- 
latitude regions (47). In contrast to the Amer- 
asian Basin, ventilation of the upper Eurasian 
Basin in the 2010s is well documented (5, 9, 36), 
and sea-ice loss in this basin continued through 
the 2010s (Fig. 1, D and E). 

There are numerous ongoing and potential 
ecological consequences of the observed physical 
changes. For example, the summer-integrated 
normalized difference vegetation index (TI- 
NDVI), a remotely sensed proxy for Arctic vege- 
tation productivity, shows markedly different 
behavior during AD+ and AD-. The TI-NDVI 
trends at all longitudes were primarily posi- 
tive in 1992 to 2006, which suggests that the 
vegetation gained biomass and photosynthe- 
tic productivity increased; the vegetation was 
“greening” (Fig. 1, C and F). In 2007 to 2021, 
negative trends dominated the area between 
210°E to 300°E (corresponding to the Amer- 
asian Basin sector where sea ice increased 
during 2007 to 2021), which suggests that the 
vegetation lost biomass and was possibly less 
vigorous, often called “browning” (Fig. 1, D and 
F). Thus, sea-ice variability can influence Arctic 
vegetation productivity on numerous time- 
scales, which is consistent with a correlation 
between the TI-NDVI and spring sea-ice var- 
iations (48). 

Furthermore, the import of subarctic waters 
has profound impacts on Arctic marine life 
(49), and both the Fram Strait and the Barents 
Sea Opening branches are potential pathways 
of subarctic-boreal organisms into the eastern 
Eurasian Basin (50). Our results suggest that 
organisms drifting in the upper 50 m of the 
Fram Strait branch had a fundamentally dif- 
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ferent fate if entering during AD- as compared 
with AD+ (Fig. 4, D and E). During AD-, most 
organisms entered the western Eurasian Basin 
via the Transpolar Drift, whereas during AD+, 
they were kept at the shelf break and trans- 
ported into the eastern Eurasian Basin. In ad- 
dition, during AD+, more organisms entered 
the eastern Eurasian Basin from the Barents 
Sea (Fig. 4E). An increased influence of the 
Barents Sea may cause the eastern Eurasian 
Basin to be more productive and provide more 
suitable living conditions for subarctic-boreal 
species than its western part (57), which is con- 
sistent with recent observations (50). Improved 
knowledge of asymmetric conditions in the 
pelagic ecosystems of the western and eastern 
Eurasian Basin is imperative to properly un- 
derstand and manage the central Arctic Ocean 
fisheries agreement established in 2021. 

In addition, we note that recent atmospheric 
changes as indicated with a shifting AD phase 
have been important drivers of the regional 
patterns of sea-ice and oceanic responses. There 
are, however, indications that the Arctic system 
may be entering another regime (see wavelets 
in figs. S3, $7, and S8), with potential conse- 
quences for the state of the physical, chemical, 
and biological components. The transition may 
be abrupt, similar to the rapid changes in 2007. 
The trajectory of the Arctic climate system 
into the future is further complicated by the 
existence of large-amplitude, multidecadal varia- 
bility (fig. S15). Thus, accurate future projec- 
tions require a comprehensive understanding of 
complex air-ice-ocean interactions and associ- 
ated feedback mechanisms on broad spatio- 
temporal scales through advancement of the 
observing system and modeling capabilities. 
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HUMAN EVOLUTION 


Genomic inference of a severe human bottleneck 
during the Early to Middle Pleistocene transition 


Wangjie Hu"*++, Ziqian Hao*+, Pengyuan Du?, Fabio Di Vincenzo‘, Giorgio Manzi°, Jialong Cui, 


Yun-Xin Fu’, Yi-Hsuan Pan2*, Haipeng Li?®* 


Population size history is essential for studying human evolution. However, ancient population size history 
during the Pleistocene is notoriously difficult to unravel. In this study, we developed a fast infinitesimal time 
coalescent process (FitCoal) to circumvent this difficulty and calculated the composite likelihood for present- 
day human genomic sequences of 3154 individuals. Results showed that human ancestors went through a 
severe population bottleneck with about 1280 breeding individuals between around 930,000 and 813,000 years 
ago. The bottleneck lasted for about 117,000 years and brought human ancestors close to extinction. This 
bottleneck is congruent with a substantial chronological gap in the available African and Eurasian fossil record. 
Our results provide new insights into our ancestry and suggest a coincident speciation event. 


Ithough the lineage of humans is esti- 

mated to have separated from that of 

chimpanzees and bonobos more than 

6 million years ago, anatomically mod- 

ern humans (Homo sapiens) are estimated 
to have originated around 300 thousand to 
200 thousand years before the present (kyr BP) 
in Africa (7-3). On the basis of present-day hu- 
man genomic sequences, the recent population 
size histories (i.e., the dynamics of population 
size since the emergence of modern humans) 
have been intensively studied, revealing the 
worldwide spread of our ancestors (4-8). How- 
ever, ancient population size history of the genus 
Homo during the Pleistocene is still poorly 
known, although it is essential for understand- 
ing the origin of the human lineage. It is likely 
to be very difficult or impossible to obtain an- 
cient DNA from African Homo samples dated 
before the emergence of H. sapiens. It would 
be particularly notable if present-day human 
genomic sequences could be used to robustly 
infer both the recent and ancient population 
size histories of humankind. Thus, a new ap- 
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proach is needed to improve the inference ac- 
curacy of population size history. 

Population size changes that occurred hun- 
dreds of thousands of years ago affected the 
rates of coalescence and thus have left their 
signatures in the site frequency spectrum (SFS) 
of genomic sequences. The SFS is the distri- 
bution of allele frequencies in the sequences, 
randomly collected from the present-day hu- 
man population. Each SFS category contains 
a certain number of mutations of the same 
size. Because SFS is crucial for demographic 
inference (6, 8-12) and construction of key 
summary statistics (13), many efforts have been 
devoted to deriving its analytical formulas (14-17). 


a Sion ened of time 


backward in time 


a long period of time. 
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However, these formulae may not aver ree 
required accuracy because of propagation '_..—— 
accumulation of numerical errors resulting from 
their dependence on the joint probability den- 
sity function of coalescent times (16-18). 

In this study, to circumvent this numerical 
problem, we developed the fast infinitesimal 
time coalescent process (FitCoal) (Fig. 1) that 
analytically derives expected branch length 
for each SFS category under arbitrary demo- 
graphic models. FitCoal does not need phased 
haplotype data and prior information on de- 
mography. The effects of sequencing errors or 
hitchhiking due to positive selection can be 
circumvented, largely by focusing on a subset 
of SFS that are less influenced by those factors. 
We used FitCoal to analyze a large number of 
present-day human genomic sequences from 10 
African and 40 non-African populations. Results 
showed that our ancestors experienced a sev- 
ere population bottleneck between about 930 
and 813 kyr BP, most likely because of climatic 
changes. The average number of breeding indi- 
viduals was only about 1280 during the bottle- 
neck period. Our findings indicate that the 
severe bottleneck brought the ancestral human 
population close to extinction and completely ‘ 
reshaped present-day human genetic diversity. 


Results 
FitCoal : 


We developed FitCoal to determine the expected 
branch lengths for an SFS (Fig. 1). During 
FitCoal analysis of a sample, the time period 
in which the most recent common ancestor 
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Fig. 1. Illustration of FitCoal. (Left) The backward process in which four lineages (represented by the 
four solid black circles at the bottom) coalesce into one (represented by the single solid black circle at 
the top) after passing through millions of infinitesimal time intervals (At). The area highlighted in blue shows the 
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backward transformation process of different coalescent states with tiny probability changes in an infinitesimal 
time interval. Thick arrows indicate high transformation probabilities, and thin arrows indicate low transformation 
probabilities. The blue and purple arrows correlate to the two events in the middle pane represented by 
blue- and purple-colored lines. Each state is indicated with a box, in which one circle indicates one lineage. 
The boxes with solid black circles represent the states with the probability of 1. The boxes with empty 
circles represent the states with the probability of 0. The probabilities between O and 1 are represented 

by gray circles. (Middle) Hypothetical coalescent trees with branches of different states, indicating the 
number of lineages. Blue branches represent a transformation from four to three lineages. Purple branches 
indicate that no coalescent event occurred. (Right) The size of a theoretical population over time. The 
width of shadowed area denoted as N(t) indicates the effective population size (i.e., the number of breeding 
individuals) at time t. It is assumed that the effective population size remains unchanged within a At. 
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histories; thin red lines represent 2.5 and 97.5 percentiles of FitCoal-inferred population 
size histories. Yellow, green, and blue lines indicate results obtained with PSMC, Stairway 


SMC++ with simulated samples. (A) Constant size model. (B) Instantaneous 


increase model. (C) Bottleneck model. (D) Exponential growth | model. (E) Exponential 
growth II model. (F) Exponential growth Ill model. In all panels, thin black lines indicate 
the true models. Thick red lines indicate the medians of FitCoal-inferred population size 


originated was partitioned into as many time 
intervals as needed, such that each time inter- 
val (At) was very small (e.g., 1 month or 1 year). 
During each time interval, the population size 
was assumed to be constant. The probabilities 
of all coalescent states (i.e., all possible ances- 
tral lineages) were calculated backward in 
time. For each state, the branch length during 
a time interval was calculated by multiplying 
its probability with population size and then 
transformed to determine the expected branch 
lengths. Because the expected branch length 
of an SFS category during a time interval was 
precalculated, FitCoal could be very fast. 


FitCoal demographic inference 


After the expected branch lengths were de- 
termined, the composite likelihood of an SFS 
(6, 9, 19) was calculated. FitCoal is effective 
for a wide range of sample sizes in the calcu- 
lation of the composite likelihood of a given 
SFS and is much more accurate than simu- 
lation approaches (fig. S1). When inferring 
population size history, the likelihood was 
maximized in a wide range of demographic 
scenarios. Moreover, both instantaneous and 
long-term exponential population changes were 


considered. Similar to previous studies (6, 19), 
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Plot, and SMC++, respectivel 


simulated sequences, and L 


the likelihood of the SFS was first maximized 
with the constant size model, followed by re- 
peated maximization of the likelihood with 
increased number of inference time intervals 
until the best model was found. 


Demographic inference from simulated data 


The accuracy of FitCoal demographic infer- 
ence was evaluated by simulation and com- 
parison of results with those of PSMC (pairwise 
sequentially Markovian coalescent), Stairway 
Plot, and SMC++ methods (6-8) (Fig. 2). To 
ensure fair comparisons, we tested six demo- 
graphic models by simulating 200 independent 
datasets for each model, as described previ- 
ously (6), with the assumption that a generation 
time is 24 years (6, 20) and that the mutation 
rate is 1.2 x 10° per site per generation (6, 27). 

Results showed that the medians of FitCoal- 
inferred population size histories were almost 
identical to the true models, and the 95% con- 
fidence intervals of FitCoal inference were nar- 
rower than those of PSMC, Stairway Plot, and 
SMC++ (Fig. 2). FitCoal inference accuracy 
could be further improved by increasing sam- 
ple size and lengths of sequences (fig. S2). The 
proportion of the most recent changes in popu- 


ly. The mutation rate is assumed to be 1.2 x 10-° per 


base per generation, and a generation time is assumed to be 24 years. n is the number of 


is the length in Mb of each simulated sequence. 


that FitCoal could distinguish between instan- 
taneous and exponential changes (table S1). 
Overall, our results confirmed that SFSs could 
be used to estimate population size histories (22). 

It has been suggested that a population size 
history could be inferred by using a subset of 
SFS or a collapsed SFS (6, 79); the latter is an 
SFS with high frequency mutations combined 


that the FitCoal could still accurately determine 
a population size history even when a portion 
(10 to 90%) of an SFS was truncated (i.e., ex- 
cluded for analysis) (figs. S3 to S5), thus reduc- 
ing the impact of confounding factors, such as 
hitchhiking effect due to positive selection (fig. 
S6) or sequencing errors on FitCoal analyses. 


Demographic inference of African populations 


To infer population size histories of African 
populations, seven African populations in the 
1000 Genomes Project (JOOOGP) (23) and three 
African populations in the Human Genome 
Diversity Project-Centre d’Etude du Polymor- 
phisme Humain (HGDP-CEPH) panel (24) were 
analyzed by the FitCoal (tables S2 to S4). Only 
autosomal noncoding regions were used to 
partially avoid the effect of purifying selection. 


lation size inferred from the six models showed 
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Fig. 3. Histories of human populations in 1OOOGP and HGPD-CEPH genomic 
datasets inferred by FitCoal, SMC++, Stairway Plot, PSMC, and Relate. The 
mutation rate is assumed to be 1.2 x 10° per base per generation, and a generation 
time is assumed to be 24 years. (A) Inferred population size histories of 26 
populations in LOOOGP. (B) Linear-scaled estimation of sizes over time of populations 
in LOOOGP during the severe bottleneck period. (C) Inferred population size histories 
of 24 populations in the HGPD-CEPH panel. (D) Linear-scaled estimation of sizes 
over time of populations in the HGPD-CEPH panel during the severe bottleneck period. 
(E and F) Comparison of population size histories of African YRI and Yoruba 


populations inferred by FitCoal, SMC++, Stairway Plot, PSMC, and Relate. Only 
the population size histories up to 200 kyr BP were analyzed by Stairway 

Plot. In (A) to (D), colored lines indicate the following: red, African populations; 
brown, Middle East populations; yellow, European populations; blue, East Asian 
populations; green, Central or South Asian populations; and gray, American 
populations. Black dashed circles with arrow heads represent the discrepancy in 
the population size between African agriculturalist and non-African populations. 
In (E) and (F), red, blue, yellow, green, and gray lines indicate results obtained 
with FitCoal, SMC++, PSMC, Stairway Plot, and Relate, respectively. 


selection (25), high-frequency mutations were 
excluded from the analysis. Results showed that 
all 10 African populations went through a sev- 
ere bottleneck (Fig. 3 and figs. S7 and S8). The 
bottleneck was estimated to persist for 117 kyr, 
from 930 + 23.52 (SEM) (range, 854 to 1042) 
to 813 + 11.02 (SEM) (range, 772 to 864) kyr BP. 
The average effective population size (i.e., the 
number of breeding individuals) (26) during 
the bottleneck period was determined to be 
1280 + 131 (SEM) (range, 770 to 2030), which 
was only 1.3% of its ancestral size (98,130 + 
8720; range, 58,600 to 135,000). To evaluate the 
impact of the bottleneck on current human 
genetic diversity, we analyzed the expected 
pairwise nucleotide diversity. Results showed 
that 65.85% of current human genetic diver- 
sity was lost because of the bottleneck. 


Demographic inference of non-African populations 


The severe bottleneck was not directly detected 
in all 19 non-African populations in 1000GP and 
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the 21 non-African populations in the HGDP- 
CEPH panel (Fig. 3, A to D; figs. S7 and S8; and 
tables S2, S5, and S6). The average ancestral 
population sizes of the populations in the two 
datasets were 20,260 (range, 18,850 to 22,220) 
and 20,030 (range, 19,060 to 21,850), respec- 
tively, similar to those determined in previous 
studies (7, 8, 24). The estimated population 
size started to decline around 368 (range, 175 to 
756) and 367 (range, 167 to 628) kyr BP, respec- 
tively, which is consistent with previous find- 
ings that African and non-African divergence 
occurred much earlier than the out-of-Africa 
dispersal (7, 8, 23, 24). The inferred out-of- 
Africa dispersal and the recent population size 
expansion and reduction are consistent with 
those of previous studies (5-8, 23, 24). 


Severe bottleneck during the Early to Middle 
Pleistocene transition 


The ancient severe bottleneck was directly de- 
tected in each of the 10 African populations 
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but in none of the 40 non-African populations. 
To investigate this discrepancy, we performed 


simulations with three demographic models, - 


designated bottlenecks I, I, and III (Fig. 4, A 
to C, and figs. S9 and S10). Bottleneck I sim- 
ulated the population size history of African 
agriculturalist populations with the ancient 
severe bottleneck, and bottlenecks IT and III 
simulated that of non-African populations with- 
out and with the ancient severe bottleneck, 
respectively. Both bottlenecks I and II were 
inferred precisely in all simulated data sets 
(tables S7 to S9). However, no ancient severe 
bottleneck was detected in bottleneck II sim- 
ulations, which indicates that the out-of-Africa 
dispersal hinders the chance of discovering the 
ancient severe bottleneck. Furthermore, the 
ancient severe bottleneck was found to cause a 
discrepancy in the estimation of the population 
size between the bottleneck III model and the 
inferred population size history after the bottle- 
neck was relieved, which suggests a hidden effect 
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Fig. 4. Verification of the severe bottleneck. (A) Bottleneck | model, 


mimicking the true population size history of African agriculturalist populations. 


(B) Bottleneck I! model, mimicking the inferred population size history of non- 
African populations. (C) Bottleneck III model, mimicking the true population 
size history of non-African populations. The gap between actual and FitCoal 


estimated population size is indicated by the black dashed circle and arrowhead. 


(D) Bottleneck IV model, mimicking a population with an exponential reduction 
in size 1.5 million years ago. (E) Bottleneck V model, mimicking a population 
with a moderate bottleneck. (F) Bottleneck VI model, mimicking a population 


with a weak bottleneck. Black lines represent the true models. Thick red lines 
represent the medians of FitCoal estimated population sizes over time; thin 

red lines represent 2.5 and 97.5 percentiles of FitCoal estimated population 
sizes over time. Blue, green, yellow, and gray lines represent the medians of 
10 runs each of SMC++, Stairway Plot, PSMC, and Relate. The mutation rate is 
assumed to be 1.2 x 10°° per base per generation, and a generation time is 
assumed to be 24 years. Numbers of simulated sequences are 188 in bottlenecks 
I, IV, V, and VI and 194 in bottlenecks II and Ill. The lengths of simulated 
sequences are 800 Mb each. 


of the ancient severe bottleneck on non-African 
populations (Fig. 4C and figs. S9C and S10C). 
After the bottleneck was relieved, the average 
population size of non-African populations in 
1000GP was 20,260, and that of those in HGDP- 
CEHP was 20,030. For African agriculturalist 
populations, the average population size in 
1000GP was 27,080, and that of those in HGDP- 
CEHP was 27,440. This population size difference 
of 7020 (Fig. 3, A and C) is likely due to the 
hidden effect of the ancient severe bottleneck 
on non-African populations. Because the out-of- 
Africa dispersal existed in non-African popu- 
lations but not in African populations, African 
populations had more lineages remaining to 
be traced back to the ancient severe bottle- 
neck (fig. S11). In the analysis of the African YRI 
(Yoruba in Ibadan, Nigeria) population, the 
minimum sample size was three individuals 
for detection of the severe bottleneck (fig. S12). 
Because the signal for the existence of the 
severe bottleneck was too weak to be detected 
in non-African populations by FitCoal, we per- 
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formed an extended FitCoal analysis. To eli- 
minate noise effects resulting from problems 
such as sequencing or overfitting errors on the 
inference of population size history, we used 
the FitCoal-inferred recent population size his- 
tory as a starting point for size estimation of 
an ancient population. With this modifica- 
tion, all 19 non-African populations in 1000GP 
were found to have gone through the severe 
bottleneck with approximately 1450 individ- 
uals between 921 and 785 kyr BP (fig. S13 and 
table S10). This result is consistent with that 
obtained with African populations. 

To further examine the severe bottleneck, we 
simulated a slow population reduction start- 
ing 1.5 million years ago (Fig. 4D). The FitCoal- 
inferred population size histories were different 
from those observed in 1000GP and HDGP- 
CEHP populations, which supports the hypoth- 
esis that a sudden size reduction occurred at 
the beginning of the bottleneck. Results of sim- 
ulations were similar to that of the observed 
cases (Fig. 3, E and F) in that they also showed 
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that PSMC, SMC++, and Relate methods (7, 8, 27) 
underestimated the severity of the ancient bot- 
tleneck (Fig. 4 and figs. S14 to $17). More- 


over, the inferred population declines shown - 


in Fig. 4A were more severe than those inferred 
from the real data because the simulated data 
were generated under the assumption of a neu- 
trally evolved single population and a homoge- 
neous recombination rate, whereas humans 
evolved with subpopulations and a heterogeneous 
recombination rate (28, 29). FitCoal was not 
found to overestimate such severity (Fig. 4) or 
to falsely detect a bottleneck when examining 
the effects of continuous and pulsed introgres- 
sions among existing and ghost populations 
(figs. S18 to $21). Therefore, the discovery of 
the ancient severe bottleneck was not due to 
overfitting the data in FitCoal analyses. 


Discussion 


In this study, we developed FitCoal, a model- 
flexible method for demographic inference. 
One key feature of FitCoal is that the expected 
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Fig. 5. Schematic diagram of Time 
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are presented. The width of the boxes l< 
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size (i.e., the number of breeding 
individuals) with naturally occurred 
fluctuations. The occurrence time of 
the out-of-Africa dispersal and the 
divergence between African and 
non-African populations are indicated. 
The gray-shaded time duration 
indicates the Early to Middle Pleis- 
tocene transition between 1250 and 
700 kyr BP. The red arrow indicates 
the peak of glaciation during the 
transition (i.e., the 0.9 Ma event). 
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inferred in this study is highlighted. The gap in the available African hominin fossil record and an indicative 
chronology for H. erectus, the LCA, and H. sapiens are shown. The estimated time period in which two 
ancestral chromosomes (chromosome, Chr.) fused to become one is also shown on the right. 


branch lengths can be accurately determined 
for an SFS under an arbitrary demographic 
model, which allows precise calculation of the 
likelihood. Analyses by FitCoal are, in most 
cases, less time consuming than those by other 
methods such as PSMC, SMC++, Stairway Plot, 
and Relate (28/32 = 87.5%) (tables S11 and S12). 
By discarding rare and high-frequency muta- 
tions, FitCoal can avoid the effects of sequencing 
errors or hitchhiking due to positive selection 
without losing its inference accuracy. Because 
both instantaneous and exponential changes 
are allowed within each inference time inter- 
val, FitCoal can reveal the dynamic of population 
size precisely. Because coalescent events be- 
come rare when tracing backward in time, the 
length of inference time interval is usually set 
to increase progressively (6-8). Although this 
strategy can capture recent population size 
changes, it may miss ancient ones. Therefore, 
FitCoal inference time intervals are allowed to 
vary during demographic inference, and FitCoal 
can make a fast and accurate inference of recent 
and ancient population size histories. 

The most important discovery with FitCoal 
is that human ancestors went through a se- 
vere bottleneck in the late Lower Pleistocene 
(Fig. 5). This ancient severe bottleneck was 
directly found in all 10 African populations, 
but only a weak signal of the existence of such 
was detected in all 40 non-African popula- 
tions. This observation is consistent with the 
coalescent theory and the occurrence of the 
out-of-Africa dispersal. Results of our large- 
scale simulations demonstrated that FitCoal 
did not falsely infer the bottleneck because of 
positive selections (figs. S6 and $22) or pop- 
ulation structure (fig. S18 to S21). Because we 
observed no overfitting cases and results ob- 
tained by examining different sets of genomic 
regions (Fig. 3, A and C, and fig. S23) were sim- 
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ilar, the existence of the ancient severe bottle- 
neck was ascertained. 

Our results indicate that the ancient severe 
bottleneck lasted for approximately 117 kyr 
(Fig. 5), and that about 98.7% of human an- 
cestors were lost at the beginning of the bot- 
tleneck, thus threatening our ancestors with 
extinction. The estimated effective population 
size during the bottleneck period was only 
1280 breeding individuals, which was compa- 
rable to the effective population sizes of other 
endangered mammals (30, 37). This size (1280) 
might have been overestimated because of hid- 
den population structure (32). Naturally occur- 
ring population size fluctuations might have 
further increased the extinction risk for our 
ancestors during the bottleneck. The bottleneck 
could also have increased the inbreeding level 
of our ancestors, thus contributing to the 65.85% 
loss in present-day human genetic diversity. 

The ancient population size reduction that 
occurred around 930 kyr BP was likely driven 
by climatic changes during the Early to Mid- 
dle Pleistocene transition (33, 34). During this 
transition period known as the “0.9 Ma event” 
(Ma, million years ago), glaciations were changed 
from predominantly short-term to long-term 
events with more extreme thermic intensity, 
especially at the peak of glaciation. This event 
resulted in a decrease in marine surface tem- 
perature to the lowest that occurred during the 
entire transition period (33), with an inferred 
long period of drought and extensive wildlife 
turnover in Africa and Eurasia (35). 

The existence of the ancient severe bottle- 
neck could explain the extreme scarcity of the 
available hominin fossil record in Africa and 
Eurasia between 950 and 650 kyr BP (Fig. 5 and 
fig. S24). In Africa, only a few fossil specimens 
dated in this time period have been found, in- 
cluding the cranial fragments from Gombore 
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in Ethiopia and the fossil samples from Tighenif 
in Algeria (36, 37). Although the taxonomic sta- 
tuses of these fossils are still not clear, they have 
features resembling those of later fossils at- 
tributed to Homo heidelbergensis. They are dif- 
ferent from the coeval Homo antecessor from a 
paleoanthropological site in Spain (Atapuerca, 
Gran Dolina), and some scholars considered 
H. antecessor as a possible alternative for the last 
common ancestor (LCA) (38). During the same 
chronological interval, the East Asian fossil 
record contains specimens identified as Homo 
erectus (39). It does not appear that East Asian 
H. erectus is connected to the ancient severe 
bottleneck because it is unlikely to have contrib- 
uted to the lineage leading to modern humans 
(38). In addition, coincident with this bottleneck, 
two ancestral chromosomes are believed to have 
fused to form chromosome 2 in humans around 
900 to 740 kyr BP (40, 41). Therefore, the ancient _ 
severe bottleneck possibly marks a speciation 
event leading to the emergence of the LCA 
shared by Denisovans, Neanderthals, and mod- 
ern humans, whose divergence has been dated 
to about 765 to 550 kyr BP (38, 42, 43). 

A rapid population recovery was detected in 
all 10 African populations with a 20-fold in- 
crease in size around 813 kyr BP. Control of 
fire could be part of the explanation for this 
population expansion, which is shown by the 
early archaeological evidence found in Israel 
dated about 790 kyr BP (44). Other factors, such 
as climatic changes (33, 34), might also be a 
driving force for this rapid population recovery. 

The ancient severe bottleneck was not de- 
tected in previous SFS-based analyses (6, 10, 12, 14). 
This failure might be due to the use of prede- 
fined demographic models. In this study, we 
found that the likelihood must be accurately 
calculated to detect the severe bottleneck (fig. 
S1). The use of other methods such as Stairway 
Plot—which may not have sufficient resolu- 
tion power for estimation of ancient popula- 
tion size history—is another possible reason for 
the failure (6). 

Our study revealed that an extremely small 
human population lasted for about 117 kyr 
around 930 to 813 kyr BP. Many questions 
remain unanswered, such as where these indi- 
viduals lived, how they overcame the cata- 
strophic climate changes, and how the ancient 
population remained so small for so long. Fur- 
ther studies are warranted to investigate these 
matters to obtain a more detailed picture of 
human evolution during the Early to Middle 
Pleistocene transition. 
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NATURAL HAZARDS 


The complex dynamics of the 2023 Kahramanmaras, 
Turkey, M,, 7.8-7.7 earthquake doublet 


Zhe Jia’*, Zeyu Jin’, Mathilde Marchandon?, Thomas Ulrich”, Alice-Agnes Gabriel?, Wenyuan Fan’, 
Peter Shearer’, Xiaoyu Zou’, John Rekoske’, Fatih Bulut?, Asli Garagon®, Yuri Fialko! 


The destructive 2023 moment magnitude (M,) 7.8-7.7 earthquake doublet ruptured multiple segments of the 
East Anatolian Fault system in Turkey. We integrated multiscale seismic and space-geodetic observations 
with multifault kinematic inversions and dynamic rupture modeling to unravel the events’ complex rupture 
history and stress-mediated fault interactions. Our analysis reveals three subshear slip episodes during the 
initial My 7.8 earthquake with a delayed rupture initiation to the southwest. The M,, 7.7 event occurred 9 hours 
later with a larger slip and supershear rupture on its western branch. Mechanically consistent dynamic models 
accounting for fault interactions can explain the unexpected rupture paths and require a heterogeneous 
background stress. Our results highlight the importance of combining near- and far-field observations with 
data-driven and physics-based models for seismic hazard assessment. 


he moment magnitude (M,,) 7.8 and 7.7 
Kahramanmaras earthquakes in Turkey 

on 6 February 2023 caused enormous de- 
struction and tens of thousands of ca- 
sualties from collapsed structures and 
together were one of the deadliest natural 
disasters for Turkey and Syria over the past 
millennium (7). The Kahramanmaras sequence 
is the first great earthquake doublet with a 
combined moment magnitude of 8 recorded 
in acontinental strike-slip fault system. Unlike 
regular aftershocks that are more than one 
order of magnitude smaller than their main- 
shock, doublet events pose a greater hazard 
because they can cause more severe damage 
by striking already weakened buildings and 
structures. We show that the Kahramanmaras 
earthquake doublet involved a remarkable se- 
quence of subevents that occurred with vary- 
ing rupture velocities, geometries, and time 
delays on branched fault segments, which chal- 
lenge our understanding of earthquake interac- 
tions and the dynamics of rupture propagation. 
Seismologists commonly approximate earth- 
quakes as point sources or as slip along a sin- 
gle fault with fixed rupture velocity. However, 
large earthquakes often rupture multiple fault 
segments within a complex network (2-6). 
Occasionally, events of a comparable magni- 
tude occur within minutes to hours of the ini- 
tial event, resulting in earthquake doublets 
(7-9). Branching faults may further compli- 
cate rupture dynamics (J0-12). Whether rup- 
ture stops or continues propagating at fault 
junctions can determine earthquakes’ even- 
tual size and destructive potential (73). When 
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applied to complex ruptures on multiple faults, 
conventional earthquake source imaging often 
involves oversimplified assumptions, yielding 
stark differences in source models and their 
interpretations (14, 15). Initial studies of the 
Kahramanmaras earthquakes presented a wide 
range of earthquake models and interpreta- 
tions (16-21), likely from focusing on particular 
datasets and aspects of the rupture process. 
These differences motivate unified and self- 
consistent approaches that integrate diverse 
datasets with state-of-the-art rupture mod- 
els to advance our understanding of the earth- 
quake dynamics. 

We performed a comprehensive investiga- 
tion of the M,, 7.8-7.7 Kahramanmaras doublet 
using data-driven and physics-based analyses 
applied to near- and far-field seismic and ge- 
odetic observations. Our results reveal that 
the earthquakes followed unexpected rupture 
trajectories, which included delayed backward 
branching, statically and dynamically aided 
triggering, and a combination of subshear and 
supershear rupture episodes. These discoveries 
call for reevaluating the role of cascading fail- 
ure mechanisms when assessing the destructive 
potential of large earthquakes within complex 
fault networks. 


The geometrically complex M,, 7.8-7.7 
earthquake doublet 


On 6 February 2023, two major (MM, > 7) earth- 
quakes ruptured several previously recognized 
fault systems within 9 hours (Fig. 1). The East 
Anatolian Fault (EAF) is a mature transform 
fault accommodating up to 10 mm/year of left- 
lateral motion between the Arabian and Anato- 
lian plates (Fig. 1) (22). Several M,, ~7 earth- 
quakes occurred on the EAF historically, but 
none ruptured the entire southern section of 
the EAF (23). The estimated dimensions of the 
historic events suggest that geometric com- 
plexities such as fault bends and step-overs 
may have controlled the event sizes (23, 24). 
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The second earthquake (M,, 7.7) occurres ee 
the Savrun-Cardak Fault (SCF), extenc-5— 
~150 km along the east-west direction (Fig. 1). 
The SCF has been relatively quiescent, with 
only two moderate (M,, <6) events recorded in 
the past 100 years (25). 

We constrained the rupture geometry on 
the basis of surface traces mapped using Syn- 
thetic Aperture Radar data (26) and precisely 
relocated aftershocks (27, 28). We found that 
the Kahramanmaras doublet ruptured at least 
six major fault segments (Fig. 1). The epicenter 
of the M,, 7.8 earthquake is located on a sub- 
sidiary fault, the Nurdagi-Pazarak (Narli) Fault 
(NPF) (Fig. 1A, fault 1) (20), from which the rup- 
ture propagated to the EAF, and then ruptured 
along the EAF to both the northeast and south- 
west (Fig. 1A, faults 2 and 3), for a total length 
of about 300 km. Unlike the historical My ~7 
events, the M,, 7.8 earthquake propagated | 
across at least four possible geometric barriers, 
including fault bends and stepovers. 

The static slip distribution (Fig. 1B) obtained 
from inversions of Synthetic Aperture Radar 
(SAR) and Global Navigation Satellite System 
(GNSS) data (figs. S1 to S7) shows that the 
largest slip in the M,, 7.8 event ison the EAF at ‘ 
its junction with the NPF, near the towns of 
Kahramanmaras and Pazarcik, with a peak 
slip in excess of 8 m. Most of the coseismic 
slip is in the upper 20 km of the seismogenic ‘ 
layer (Fig. 1B). Slip at the surface is highly 
heterogeneous, which is consistent with field 
observations (/8), but on average increases 
from the southwest to the northeast ends of 
the M,, 7.8 rupture (fig. $8). The area of sub- 
stantial slip extends to the northeast from 
the junction for about 150 km to the western 
tip of the 2020 M,, 6.7 Elazig rupture (Fig. 1A) 
(29). South of the junction, the M,, 7.8 rup- 
ture extends to the southern end of the EAF. 
The average coseismic slip on the southwest ‘ 
branch of the M,, 7.8 rupture is smaller than 
the average slip on the northeast branch (Fig. 
1B and fig. $2). 

We resolved the spatiotemporal rupture 
process with a subevent inversion method by 
using both near- and far-field seismic obser- 
vations (30, 31). The My, 7.8 earthquake had 
six subevents that altogether spanned ~90 s 
(Fig. 2A). The M,, 6.8 subevent E1 that rup- 
tured the NPF was followed 18 s later by the 
largest subevent E2 (M,, 7.5) at the NPF-EAF 
intersection. The earthquake then ruptured 
northeastward along the EAF for about 130 km 
(M,, 7.5 subevent E3), as well as, after a short 
delay, backward from the NPF junction for 
about 150 km along the southwestern segment 
of the EAF, with an integrated slip equivalent 
to a M,, 7.4 earthquake (subevents E4 to E6). 
Teleseismic P wave back-projection (32) con- 
firmed the rupture process, with imaged high- 
frequency radiation peaks outlining the major 
subevents (Fig. 2A) and indicating an average 
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rupture velocity of 3 km/s. To further con- 
strain the slip history, we performed a joint 
kinematic slip inversion of the My, 7.8 earth- 
quake constrained by far- and near-field seis- 
mic and geodetic data (26, 33). Our kinematic 


inversion results agree with the static and 
subevent models (Fig. 2B). The best-fit kine- 
matic slip model images 10-s-delayed backward 
branching at the NPF-EAF intersection, toward 
the southwest (Fig. 2B), constrained by the 
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strong-motion data (fig. S17). It also indi- 
cates average rupture velocities of 3.2 km/s and 
2.8 km/s for the northeastern and southwest- 
ern branches, respectively (fig. $18). Tracking 
ground motion pulses at near-fault strong 


0 0.5 1 
Normalized slip 


Fig. 1. A multifault earthquake doublet. (A) Tectonic background and aftershock — blue beachball denote the rupture extent and focal mechanism of the 2020 M, 6.7 
seismicity of the study area near Kahramanmaras, Turkey. Red and purple stars Elazié earthquake (29). (Inset) The regional tectonics and major plate boundary 


indicate the M,, 7.8 and 7.7 earthquake epicenters according to the Turkey Disaster 


faults (solid black lines). Red outline denotes the study area. (B) Finite-fault model of 


and Emergency Management Authority (53), and red and purple beachballs indicate — the 2023 doublet derived from inversions of space geodetic (INSAR and GNSS) 


focal mechanisms from the Global Centroid Moment Tensor catalog, respectively. 


data. Fault segment numbers correspond to those shown in (A), in order of their 


Red and purple lines indicate surface ruptures identified from SAR data (26). Yellow —_ rupture time: 1, Nurdagi-Pazarcik Fault; 2 and 3; East Anatolian Fault; 4 to 6, 


dots indicate aftershocks for the period between the M,, 7.8 and 7.7 earthquakes, 


Savrun-Cardak Fault. (Inset) The along-strike averaged coseismic slip normalized by 


and black dots indicate aftershocks after the M,, 7.7 event (28). The blue line and the maximum slip amplitude, as a function of depth (49). 
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Fig. 2. Complex slip evolution of the M,, 7.8 earthquake, including delayed 
initiation of slip. (A) Subevent model from near- and far-field seismic 
observations and back-projection results, suggesting that the My 7.8 earthquake 
initiated on the NPF-1 (Fig. 1B, fault 1), then propagated bilaterally, northeast 
along the EAF-2 (Fig. 1B, fault 2) and southwest along the EAF-3 (Fig. 1B, fault 3). 
The rupture of fault 2 terminates around 50 s, whereas rupture of fault 3 
continues for an additional 30 s. (B) Rupture history within different time 


Jia et al., Science 381, 985-990 (2023) 


39° 


1 September 2023 


80} 


C Distance along EAF (km) 
~150 -100 -50 0 50 100 150 
5) 5|| @ Subevent 
3: 5:| BP 
= &!| — Finite Fault 


intervals from our kinematic slip inversion of far- and near-field seismic and 
geodetic data. We infer rupture velocities of 3.2 and 2.8 km/s for the northeast 
and southwest episodes, respectively, and a 10-s delay in the onset of the 
southwest rupture along EAF-3 with respect to the NE rupture along EAF-2. The 
slip distribution within each time interval agrees with the subevent (black circles) 
inversion. (©) Subevents, back-projection locations and times, and finite-fault 
velocities [in (B)] consistently indicate delayed initiation of slip on branch EAF-3. 
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motion stations along the southwestern seg- 
ment also yielded a rupture velocity of ~3 km/s 
(fig. S18), confirming an overall subshear na- 
ture. All our kinematic models consistently 
reveal a ~300-km-long complex bilateral multi- 
segment rupture, subshear rupture velocities, 
and delayed triggering of the southwest seg- 
ment of the M,, 7.8 event (Fig. 2C). 

The subsequent M,, 7.7 earthquake ruptured 
a 150-km-long section of the west-trending 
SCF, within 90 km of the M,, 7.8 earthquake 
hypocenter. The aftershock distribution and 
surface offsets indicate branching and abrupt 
changes in strike at both the eastern and west- 
ern ends of the M,, 7.7 rupture (Fig. 1). Geo- 
detic data and our associated static slip model 
(Fig. 1B) suggest rupture along an 80-km- 
long segment of the SCF system (Fig. 1, faults 
4 and 5), but not along the eastern end of the 
Sirgii fault that connects to the EAF. In- 
stead, the M,, 7.7 rupture diverted sharply 
onto the Dogansehir branch, which angles 
to the northeast (Fig. 1, fault 6). The M,, 7.7 
event shows a concentrated slip distribution 
with >10 m peak slip around its hypocenter, 
suggesting a substantially higher stress drop 
than that of the initial M, 7.8 earthquake, 
which spread a lower-amplitude slip over a 
larger region. 

Our analysis of the rupture history of the 
M,, 7.7 event identified four major subevents, 
lasting for about 30 s (Fig. 3A). The first three 
subevents, El to E3, all cluster near the epi- 
center and account for more than 80% of the 
total seismic moment, suggesting a compact 


bilateral rupture in the central SCF. The focal 
mechanism (strike of 237°) and location of 
the last subevent (E4; M,, 7.1) agree with the 
static slip model on the Dogansehir branch 
(Fig. 1B). All subevents of both earthquakes 
have almost pure double-couple mechanisms 
(Figs. 2A and 3A), suggesting that the strong 
non-double-couple components in the Global 
Centroid-Moment-Tensor solutions (Fig. 1A) (34) 
are due to highly variable rupture geometries. 
The overall shorter duration and smaller rup- 
ture extent of the M,, 7.7 event make back- 
projection analysis less effective for resolving 
rupture details, but our kinematic finite-slip 
inversion can still be applied. 

The kinematic finite-fault model of the ,, 
7.7 earthquake also indicates a compact slip 
distribution. In addition, it indicates a west- 
ward rupture velocity of ~4.5 km/s (Fig. 3B), 
exceeding the shear-wave speed in the crust. 
The waveforms recorded at the westward seis- 
mic stations strongly constrain this supershear 
rupture episode (Fig. 3C and fig. S19), which is 
consistent with analysis of high-rate GNSS data 
(20). By contrast, the eastward rupture likely 
propagated at a slower velocity of 2.5 km/s. 
The intriguing supershear rupture episode 
may imply locally higher prestress (35) and 
high stress drop (36) as in our dynamic rup- 
ture models. 


Dynamics, triggering, and stress interaction 
of the doublet 


Dynamic rupture modeling involves simulat- 
ing how earthquakes nucleate, propagate, and 
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Fig. 3. Asymmetric kinematics of the M,, 7.7 earthquake. (A) Three subevents 
close to the hypocenter suggest a bilateral rupture. The fourth event images the 
rupture of the Dogansehir branch (Fig. 1B, fault 6). (B) Asymmetric bilateral 
rupture velocities of the My 7.7 event. The westward rupture has an inferred 
supershear velocity of 4.5 km/s, whereas a subshear velocity is seen toward the 
east (2.5 km/s). Subevent locations are based on their seismic moment 
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arrest. Unlike purely data-driven kinematic slip 
inversions, such models predict the evolution 
of slip, seismic waves, and surface deformation 
in a physically self-consistent manner. Detailed, 
physics-based interpretations can help verify 
whether inferred rupture scenarios are me- 
chanically plausible but are computationally 
challenging and typically take years to devel- 
op [for example, (JO, 12, 13)]. 

We present data-informed dynamic rupture 
simulations of the 2023 Kahramanmaras earth- 
quakes that illuminate complex details of the 
rupture process. Our three-dimensional (3D) 
dynamic rupture models include stress changes 
computed from the slip distribution of the 
static slip model (37), large-scale variability in 
fault loading inferred from regional seismo- 
tectonics, and the relative effects of the static 
and dynamic stresses of the M,, 7.8 event on 
the faults hosting the second earthquake (fig. | 
$20) (26). The dynamic rupture models inde- 
pendently reproduce the main features of the 
kinematic models (Fig. 4 and fig. S21), pro- 
viding a physics-based validation of the in- 
ferred rupture histories. 

Our forward simulations use the complex 
fault geometries of both earthquakes informed 
from geodetic analysis (Fig. 1) to spontaneously 
replicate the moment rate release, magnitude, 
rupture velocity and delays, as well as the lack 
of instantaneous dynamic triggering of the 
M,, 7.7 event. The dynamic rupture synthetics 
produce surface displacements and slip histor- 
ies that compare well with the high-resolution 
geodetic data (fig. S22), kinematic rupture 
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centroids. The slip may not be the largest at the centroid location, specifically for 
bilateral ruptures. For example, E3 (10 to 30 s) averages slip pulses of both the 
westward supershear and the eastward subshear rupture. (C) A westward 

supershear rupture velocity (red waveforms) better explains observed waveforms 
(black) at near-fault strong motion stations to the west [triangles in (A)] than a 
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Fig. 4. 3D dynamic rupture scenarios and stress-mediated interactions of the 
M,, 7.8 and 7.7 earthquakes. (A) Snapshots of absolute slip rate evolution in the 
My 7.8 dynamic rupture scenario (movie S1). (B) Modeled rupture speeds in linked 
dynamic rupture simulations (54) of both earthquakes indicating dominantly 
subshear rupture speeds but sustained westward supershear during the My 7.7 


representations (Fig. 4 and fig. $21), and ob- 
served ground motions (Fig. 5 and figs. S23 
to S25). The modeled M,, 7.8 earthquake dy- 
namics are illustrated in Fig. 4A. The NPF-EAF 
intersection slows subshear rupture on the 
NPF that then branches with dynamically fa- 
vorable forward directivity (38) northeastward 
along the EAF. The large fault branching angle 
poses a strong dynamic barrier in backward- 
directivity (39), leading to substantially delayed 
EAF rupture toward the southwest. Continuous 
dynamic unclamping, transient shear stress- 
ing, and static stress buildup at the fault in- 
tersection due to the unilaterally propagating 
northeast rupture allowed the rupture to even- 
tually fracture the EAF bilaterally (fig. S26). 
Rupture speed remained overall subshear dur- 
ing the earthquake (Fig. 4B). 

Dynamic rupture modeling of the M,, 7.7 
earthquake features bilateral rupture with un- 
equal rupture speeds, confirming dominant 
supershear westward and subshear eastward 
propagation. Our M,, 7.8 dynamic rupture 
model predicts a highly variable pattern of 
static and dynamic stresses resolved on the 
faults that hosted the M,, 7.7 earthquake (Fig. 4, 
C and D). The hypocentral area of the M,, 7.7 
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event experienced an increase in static Coulomb 
stress of several hundred kilopascals because 
of the M,, 7.8 earthquake, resulting from both 
an increase in shear stress and a decrease in 
fault-normal compression (fig. $27). It also ex- 
perienced a much larger transient increase 
in the Coulomb stress of a few megapascals 
owing to passing seismic waves (Fig. 4C), which 
nevertheless did not result in instantaneous 
triggering. 


Discussion and conclusions 


Our analyses reveal unexpected rupture paths. 
The Kahramanmaras doublet originated as a 
moderate event on the NPF branch fault with 
a magnitude of only 6.8, yet the rupture was 
able to successfully cross the junction of the 
NPF and EAF, which would usually be con- 
sidered a geometric barrier that conditionally 
gates the rupture propagation (40, 47). As a 
result, the earthquake intensified with the 
northeastward propagation along the EAF then 
dynamically triggered backward rupture toward 
the southwest by continuously unclamping and 
stressing from the forward branch, eventually 
culminating in a M,, 7.8 event, with total seis- 
mic moment increased by a factor of 30 com- 
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scenario (fig. S21 and movie S2). (©) Peak absolute dynamic shear stress 
perturbation reaching up to 7 MPa measured in the direction of maximum initial 
traction. (Inset) Evolution of dynamic shear stress and fault strength at the M,, 7.7 
hypocenter (black star). (D) Static Coulomb failure stress changes (ACFS) 
assuming a static friction coefficient of 0.6. 


pared with the initial rupture on the NPF. In 
addition, the M,, 7.8 earthquake increased the 
Coulomb stress on the central part of the SCF, 


process highlights the additional hazard brought 
by rupture triggering across a network of faults, 


that typically do not consider such multifault- 
triggering scenarios. 

The M,, 7.8 earthquake involved backward 
fault branching, which is highly unfavorable 
from a dynamic perspective, thus commonly 
neglected in hazard studies. Several previous 
continental earthquakes—including the 1992 
Landers, the 1999 Hector Mine, and the 2002 
Denali earthquakes—have also exhibited local- 
ized backward branching (0). Existing explan- 
ations of this phenomenon include backward 
rupture jumping induced by sudden rupture 
arresting or nonuniform prestress fields caused 
by earthquake cycles (39, 42). Our dynamic 
rupture models indicate that backward branch- 
ing during the M,, 7.8 event does not necessarily 
require a complex arrangement of the receiver 
fault (42) or triggering of supershear rupture 
(43). Instead, the progressive build-up of slip on 
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which may have aided the nucleation of the 
M,, 7.7 earthquake 9 hours later. The entire ‘ 


challenging earthquake hazard assessments - 
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Fig. 5. Peak ground velocities (PGVs) plotted against Joyner-Boore distance (Rjg) for the M,, 7.8 
and M, 7.7 earthquakes. (A) The My 7.8 earthquake. (B) The M,, 7.7 earthquake. Observed PGVs from 
strong motion accelerometers are indicated with open black circles, and simulated PGVs from the dynamic 
rupture simulations are indicated with open blue squares. We bin the PGV data by Rjg and plot the medians 
for each distance bin (solid markers). The red curve indicates PGV predicted by a ground motion model 
(55), assuming an average shear wave velocity for the top 30 m of soil (VS39) of 760 m/s, with the shaded 
area denoting its uncertainty. All PGV are rotationally independent geometric mean values (GMRotD50). 
We include simulated and observed data at the same locations, respectively. 


the forward branch of the EAF continuously 
unclamps and stresses the backward branch 
of the EAF, eventually leading to a delayed and 
self-sustained branching toward the southwest, 
which is a simple yet effective mechanism. 
One of the unexpected aspects of the My, 7.7 
earthquake is that it did not rupture through 
the eastern Siirgii segment and arrive at the 
EAF, contrary to earlier suggestions (20), but 
instead deviated to the Dogansehir branch. 
The Interferometric Synthetic Aperture Radar 
(mSAR), aftershock, and seismic data clearly 
show such a deviation (figs. S1 and S14). The 
straightforward rupture path along the Siirgii 
fault was encouraged by the static stress changes 
from the M,, 7.8 event (fig. S27), unlike the sharp 
deviation to the Dogansehir fault, which was 
actually unloaded by the M,, 7.8 event (Fig. 4D 
and movie S3). Possible explanations, which may 
be tested by future geodetic and seismological 
observations, include velocity-strengthening be- 
havior of the eastern Siirgii segment or local 
stress heterogeneity, for example, because of 
past earthquakes (44). Considerable regional 
stress heterogeneity, as is required by our dy- 
namic rupture models (fig. S20), is implied by 
extremely complex rupture geometries that 
involve changes in the strike angle of up to 90° 
(Fig. 1 and fig. S27) (45). Some faults in the 
study area, including the EAF, exhibit shallow 
creep (46); however, creep has to be pervasive 
to potentially suppress an incoming dynamic 
rupture. Observations spanning all phases of 
the earthquake cycle are needed to constrain 
the velocity- and depth-dependent frictional 
properties of active faults (47, 48). Shallow 
creep might be responsible for a substantial 
reduction in the amplitude of coseismic slip in 
the top few kilometers of the upper crust (Fig. 
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1B), which is well resolved in our inverse mod- 
els (fig. S28). Subsequent observations will show 
whether this reduction can be compensated 
by shallow afterslip or constitutes a long-term 
shallow slip deficit (49), implying widespread 
off-fault yielding (47, 50). 

We also found intriguing variations in rup- 
ture velocity across segments of the EAF-SCF 
fault network. Although the M,, 7.8 event pro- 
duced extreme shaking with peak ground ac- 
celerations (PGAs) exceeding 1 g for near-fault 
stations, the observed and simulated M,, 7.7 
ground motions are similar or larger when com- 
pared at the same distance (Fig. 5 and figs. S23 
and S24), which is consistent with a potentially 
larger stress drop of the M,, 7.7 event. The 
modeled and observed M,, 7.7 event shaking 
shows less distance dependence, which may be 
due to the effects of supershear rupture. 

The western branch of the SCF experienced a 
supershear rupture episode, whereas the east- 
ern SCF branch and the EAF hosted subshear 
ruptures with considerable delays. In general, 
our modeling shows that the pre-event stress 
heterogeneities, dynamic and static redistribu- 
tion of stress, and the geometry of the faults may 
control these diverse rupture characteristics. 

The Kahramanmaras doublet ruptured mul- 
tiple faults in distinct slip episodes, likely in- 
volving complex stress-triggering processes 
across different temporal and spatial scales. 
Such processes resulted in the increased rupture 
length and seismic moment of the Turkey earth- 
quake doublet, and a substantially larger de- 
structive potential compared with the “typical” 
M,, ~’7 historical earthquakes in the region (23). 
Such a variability might be interpreted in terms 
of the supercycle model (51). By using inte- 
grated methods that combine near- and far- 
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field seismic and geodetic observations and 
investigating data-derived models and physics- 
based rupture simulations, we show that stress 
interactions and static and dynamic triggering 
worked together across a complex fault sys- 
tem, resulting in a cascade of rupture with a 
larger than usual total rupture length and mo- 
ment magnitude. Our study shows that com- 
plementary data-driven and physics-based 
analyses, which in isolation often lead to non- 
unique or even contradictory results, can jointly 
and efficiently unravel highly complex earth- 
quake dynamics based on dense near-field ob- 
servations. The unusual static and dynamic 
interactions during and between the events of 
the Kahramanmaras doublet call for reassess- 
ment of common assumptions built into seis- 
mic hazard assessments. 
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EVOLUTIONARY BIOLOGY 


Wild pedigrees inform mutation rates and historic 
abundance in baleen whales 


Marcos Suarez-Menéndez**+, Martine Bérubé"+, Fabricio Furni?, Vania E. Rivera-Leon’, 
Mads-Peter Heide-Jorgensen®, Finn Larsen‘, Richard Sears®, Christian Ramp>°, 
Britas Klemens Eriksson’, Rampal S. Etienne’, Jooke Robbins*+, Per J. Palsbgll"2*+ 


Phylogeny-based estimates suggesting a low germline mutation rate (1) in baleen whales have influenced 
research ranging from assessments of whaling impacts to evolutionary cancer biology. We estimated p: directly 
from pedigrees in four baleen whale species for both the mitochondrial control region and nuclear genome. 
The results suggest values higher than those obtained through phylogeny-based estimates and similar to 
pedigree-based values for primates and toothed whales. Applying our estimate of 1 reduces previous genetic- 
based estimates of preexploitation whale abundance by 86% and suggests that pp cannot explain low 

cancer rates in gigantic mammals. Our study shows that it is feasible to estimate p directly from pedigrees in 
natural populations, with wide-ranging implications for ecological and evolutionary research. 


daptive evolution is ultimately driven by 

the emergence of novel genotypes, which 

in turn are a result of de novo germline 

mutations and recombination. The rate 

of de novo mutations (or p, the proba- 
bility of a nucleotide substitution per site per 
generation) varies considerably among taxo- 
nomic groups. Multiple causes have been pro- 
posed to explain observed differences in u 
(J), such as selection on p itself or differences 
in physiological and cellular processes (e.g., 
generation time, metabolic rates, or DNA re- 
pair mechanisms). The germline mutation rate 
is also a central parameter in evolution and 
population genetic inference and is neces- 
sary to convert genetic estimates into intuitive 
quantities, such as years and abundance. Most 
estimates of u are inferred from the number of 
observed substitutions among homologous 
DNA sequences in phylogenies (upyy) with 
dated branching points (e.g., using fossils or 
carbon dating). In a few species, un has been 
inferred from comparisons of ancient and 
contemporary DNA sequences from samples 
across a wide temporal range (Unc), typically 
yielding higher estimates than Uppy (2-4). The 
observed discrepancies among different estima- 
tions of u may reflect either real biological dif- 
ferences or experimental issues. Both approaches 
are subject to multiple sources of error (e.g., 
incorrect mutation model, postmortem DNA 
damage, and sampling bias) and uncertainty 
(e.g., incomplete and/or poorly dated fossil 
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records and unknown mutation model pa- 
rameters), aspects that also vary among spe- 
cies groups (5-10). However, 1: can now also be 
inferred directly from the pedigrees of individ- 
uals (pgp) through inexpensive whole-genome 
sequencing (77). Direct estimation of upgp re- 
lies on far fewer assumptions and depends not 
on fossil dating or access to ancient samples 
but rather on contemporary parameters, such 
as generation time. Perhaps most importantly, 
Upgp estimates yield generational values of u 
that are directly comparable among species, 
thus providing a means to test current hypothe- 
ses about the cause(s) of differences in 1 and 
to revisit key findings and hypotheses based 
on UpHy- 

Baleen whales (Mysticeti, Cetacea) include 
the largest and longest-lived mammals and 
are at the center of ongoing questions that 
pertain to u. Their gigantic body sizes and rel- 
atively low metabolic rates (compared with 
smaller-bodied mammals) are thought to re- 
sult in less intracellular DNA damage and 
thus a lower expected 1 (72). Indeed, prior es- 
timates of pp in both the nuclear and mito- 
chondrial (mt) genome of baleen whales were 
lower than similar estimates for smaller-bodied 
mammals with similar generation times, such 
as primates (13). This observation has been 
the subject of considerable debate regarding 
the cause and consequences of the lower upyy 
(13, 14). The lower upyy observed in baleen 
whales has been proposed as one of several 
possible explanations for low cancer rates in 
baleen whales, part of a phenomenon known 
as Peto’s paradox (15, 16). Later work has 
proposed other specific mechanisms, such as 
duplication of cancer suppression genes (J4, 17), 
as more plausible explanations for lower can- 
cer rates in baleen whales and other large mam- 
mals, but the role of the lower upyy remains 
untested. 

Another important implication of u relates 
to historical whaling. Baleen whales were sub- 
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jected to overexploitation on a global S| 
and determining the magnitude of that-—- 


q 


Chec 


upde 


pletion is critical for understanding the current 
status of whale populations and the carrying 
capacity of the historic oceans. The values of 
Uppy that were used to estimate preexploita- 
tion abundance, based on current levels of ge- 
netic diversity in some whale species (8, 19), 
yielded abundance estimates that were 4 to 
10 times higher than results obtained by non- 
genetic approaches (20). The high genetic es- 
timates of preexploitation whale abundance 
imply less population recovery than expected 
and historic oceans that supported a much 
larger biomass of whales and prey. 


Nuclear mutation rate 


Estimates of upgp in the nuclear genome have 
so far been obtained from 84 vertebrate spe- 
cies, including 36 mammals (77). Most non- 
human mammal estimates of ppp have been 
based on known sire-dam-offspring trios (hence- 
forth referred to as trios) from captive pop- 
ulations. In contrast, estimates of Upgp in wild 
populations are rare and confined to isolated, 
easily sampled groups of parents and offspring 
in only five species (table S1). Bergeron and 
colleagues (72) noted that estimates of upgp in 
captive animals are likely not representative 
of wild conspecific populations, which differ 
in terms of generation times as well as the 
nature and level of natural selection. Here, we 
show that estimating Upgp in wild, difficult-to- 
sample populations is feasible and straight- 
forward. Toward this end, we identified trios 
among baleen whale samples of unknown an- 
cestries (table S2) and sequenced the genome 
of each member in multiple trios, from which 
we estimated Upgp. 

We used genetic parentage exclusion anal- 
ysis (based on microsatellite genotypes; table 
$3) with molecular sex determination to iden- 
tify trios among skin samples collected from 
free-ranging North Atlantic blue (Balaenoptera 
musculus), fin (Balaenoptera physalus), bow- 
head (Balaena mysticetus), and humpback 
(Megaptera novaeangliae) whales. Despite sam- 
pling a relatively low proportion of the over- 
all population in some species (table S2), we 
were able to identify trios. Available a priori 
pedigree information (only for some humpback 
whales) was not used in trio identification. We 
subsequently estimated upgp in baleen whales 
from the de novo germline mutations detected 
in the whole-genome sequences in eight trios 
(21 genomes in total): one each in blue, fin, 
and bowhead whales, and five in humpback 
whales. 

The complete genome was sequenced in all 
individuals to an average depth of ~32x and 
aligned against the blue whale genome (27). In 
all eight trios, the genome sequences con- 
firmed the relationships inferred from the 
microsatellite data. An initial set of de novo 
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Fig. 1. Summary of the 
estimation of pp_ep in 
the nuclear genome per 
sire-dam-offspring trio 

in humpback, blue, fin, 
and bowhead whales. 
Squares and circles 
denote males and females, 
respectively, and contain 
the sample identification 
number (top), the first 
sighting year (middle), 
and the mean genome 
sequence coverage 
(bottom). The first sighting 
was known to be the birth 
year or, where marked 
with an asterisk, the year 
first sighted before sampling. 
A question mark denotes 
individuals with no previous 
sightings. Values along 

the vertical lines are the total 
number of detected de novo 
mutations (DNMs, in bold) 
as well as the number 

of de novo mutations 
originating from the sire 
(number to the left of the 
vertical line) and dam 
(number to the right of 
the vertical line). The 
parameter upep denotes 
the estimate obtained 
from each trio, with the 
95% confidence intervals 
in parentheses. [Whale 
illustrations by Frédérique 
Lucas (relative sizes are 
not to scale)] 


mutations was identified in each offspring’s 
genome by means of a modified version of the 
bioinformatic pipeline by Bergeron et al. (22). 
In total, 240 de novo mutations were identi- 
fied in the eight offspring, among which 113 
were phased, of which 90 (80%) were of pa- 
ternal origin (tables S4 and S5). De novo mu- 
tations transmitted across two generations in 
humpback whales (Fig. 1) segregated accord- 
ing to Mendelian expectations. No de novo 
mutations were shared between half-siblings 
(Fig. 1, suggesting that most de novo muta- 
tions arose during meiosis. The majority of 
the de novo mutations comprised substitu- 
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tions from a strong to a weak base, with no 
functional impacts (fig. S6). In this study, age 
was only known for humpback whale individ- 
uals sighted earlier, either as a calf during their 
birth year (i.e., absolute age) or in a later year 
as adults (.e., minimum age). However, the 
humpback whale pgp estimates suggested a 
positive correlation between the number of 
detected paternal de novo mutations and pa- 
ternal age at conception, in agreement with 
earlier studies (17, 22-24). 

De novo mutations were callable in ~60% of 
the autosomal genome, yielding estimates of 
Upgp at 1.12 x 10~® [95% confidence interval 
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pep 


(0.64 - 1.42x10°8) 


=1.038x108 Upep =0-95x 108 32 DNs 


(0.58 - 1.33x10°8) 


Upep=1.27x108 
(0.84 - 1.70x10 8) 


iv) 


. mysticetus 


30 DNs 


Upep=1.20x108 
(0.78 - 1.61x10 8) 


(CI): 0.94 x 10~* to 1.30 x 107°] in the hump- 
back whales and 11 x 10~* (95% CI: 0.97 x 10° 
to 1.25 x 10-8) in all baleen whales combined 
(Fig. 1 and table S4). Paternal age has a large 
impact on the number of de novo mutations in 
offspring and consequently on individual trio 
estimates of Upgp (77). Humpback whale sires 
were selected in a uniform manner (Fig. 1) cen- 
tered around the generation time in humpback 
whales [21.5 years (25)]. Although ages were 
unknown for the other baleen whale species, 
the overall baleen whale and humpback whale 
estimates of Upgp were very similar to each 
other. They were also similar to estimates of 
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Fig. 2. Comparison of estimates of nuclear jipep and preexploitation humpback 
whale abundance. (A) Estimates of nuclear pep for baleen whales (this study), toothed 
whales, chimpanzees, and humans (table S6) ordered by body mass. (B) Genetic 


Upgp in humans, large primates, and toothed 
whales (11, 23, 26) (Fig. 2A and table S6), spe- 
cies with similar generation times and pater- 
nal ages. Our results therefore contradict the 
current notion that p is lower in baleen whales 
than in smaller-bodied mammals with similar 
generation times. Given the crucial role of pa- 
ternal age in terms of the number of de novo 
mutations in the offspring, we note that there 
is currently only one method to determine chro- 
nological age from tissue samples—DNA meth- 
ylation (27)—but the development costs are 
high, and samples from known-age individ- 
uals are required for calibration. However, a 
simpler and more straightforward way to ob- 
tain an estimate of upgp, which reflects the 
average in wild populations, would be to con- 
secutively add estimates of ttpep from addi- 
tional trios until stationarity of upgp is reached, 
suggesting that the estimate of upgp has 
converged on the population mean. It is worth 
noting that population generation times and 
reproductive maturity are nonstatic entities that 
change with the underlying population dynam- 
ics. Most contemporary baleen whale populations 
are recovering from historic overexploitation, 
and thus the demographic structure may still 
be skewed toward younger individuals when 
compared with stable populations. In addition, 
current generation time estimates for baleen 
whales are exclusively informed by female re- 
productive histories (25), ignoring the sires from 
which ~80% of de novo mutations were inherited. 
Thus, if anything, our estimate of upgp could be 
biased downward relative to stable populations. 

Previously reported low uppy estimates seemed 
to be in agreement with the hypothesized ef- 
fects of a reduced metabolic rate in gigantic 
mammals (500x to 2000x by weight relative 
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to humans), resulting in lower levels of intracel- 
lular free radicals and consequently less DNA 
damage and mutations (73). However, our esti- 
mate of Upgp in baleen whales is similar to that 
observed in primates, which have much higher 
cancer rates. Barring the possibility that DNA 
damage repair mechanisms differ substantially 
between somatic and germline cells in baleen 
whales (28), our result thus negates the notion 
that the lower observed cancer rates in gigantic 
mammals are due solely to lower mutation rates, 
ie., the cause of Peto’s paradox (14, 29). 


Mitochondrial mutation rate 


In addition to the nucleus, mammal cells also 
contain maternally transmitted mitochon- 
dria, which have their own genomes. The fast- 
evolving control region in the mt genome is 
one of the most common genetic markers em- 
ployed in population genetics. Contemporary 
levels of genetic diversity in baleen whales have 
been used to infer unexpectedly high preex- 
ploitation abundances of North Atlantic baleen 
whales (18). These genetic assessments of the 
impacts of whaling were based on upyy. The 
much smaller size of the mt genome (~16,500 
base pairs) implies few, if any, de novo muta- 
tions in each transmission, and thus the above 
approach to infer upgp is infeasible for the mt 
genome in most species. However, Uprp for the 
mt genome can be inferred from the frequency 
of heteroplasmy and the change in the ratio of 
heteroplasmy between females to their off- 
spring (3, 30), an approach so far applied to a 
few model species (37-33) and only a single 
wild population of penguins (3). 

We assessed mt heteroplasmy using data 
from long-term research on North Atlantic 
humpback whales in the Gulf of Maine since the 


1 September 2023 


B 


250,000 49: 36x10-6 
200,000 4 
150,000 5 


0.93x10% 
mA 4.3x10-° 
0-4 


abundance 


100,000 4 
50,000 4 


North Atlantic humpback whale 


Phylogenetic Pedigree Historical ee Pelee ieee! eooes 93 


(dark gray and black) and nongenetic (light gray) estimates of preexploitation and 
contemporary abundance of North Atlantic humpback whales; genetic estimates based 
on mt ppyy scaled to a generation time at 18 years used in 


he original analysis. 


1970s. Among 850 sampled humpback whales, 
we identified 141 distinct maternal lineages, 
characterized by 20 different mt haplotypes. 
Each maternal lineage contained between 2 
and 49 whales and spanned from one to four 
generations. De novo mutations in the mt 
control region were evident as heteroplasmy, 
detected initially by Sanger sequencing (34) 
and subsequently confirmed by massive par- 
allel sequencing (Fig. 3). Heteroplasmy was 
detected in the oldest known female in nine 
maternal lineages and in a minimum of one 
descendant, confirming the transmission of 
heteroplasmy along the maternal germline. In 
total, heteroplasmy was detected in one, three, 
and five lineages at nucleotide positions 235, 
258, and 82, respectively (table S8). The hetero- 
plasmy detected at position 82 was identified 
in three different mt haplotypes, supporting 
the well-known presence of mutational hot- 
spots with elevated mutation rates in the mt 
control region (19, 35). The heteroplasmy ob- 
served in seven lineages gave rise to mt control 
region haplotypes already present in the pop- 
ulation (table S8). These de novo mutations 
could go unaccounted for in estimates of tpyy 
(19). The mt control region haplotypes result- 
ing from the heteroplasmy in the remaining 
two lineages were novel in this population. 
We inferred upgp from the generational 
change in the frequency of each variant at the 
heteroplasmic site, as per Hendy and colleagues 
(30) (Fig. 3), from massive parallel sequenc- 
ing data with an average read depth of 6900x. 
In total, 47 maternal transmissions of a germ- 
line mt heteroplasmy yielded a median esti- 
mate of the number of segregating units at 
10.4 [interquartile range (IQR): 5 to 28.9; fig. 
S7 and table $9] during the oocyte bottleneck. 
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The number of segregating units was very sim- 
ilar to the range of values reported in other 
mammals (3, 31-33, 36). The final estimate of 
Upgp for the mt control region was 4.3 x 1o® 
(IQR: 1.55 x 10° to 8.85 x 10~°; table S6), sim- 
ilar to comparable estimates of upgp for the 
same mt region in humans (3.8 x 107°; IQR: 
1.18 x 10~6 to 1.16 x 107°) (32). The single tanc 
estimate from baleen whales [bowhead whale 
(4)] was 7 x 10~° to 1.05 x 107° (depending on 
the assumed generation time; table S6), but a 
subsequent assessment by the authors deemed 
the estimate unreliable (37). 

We assessed the effect of our estimated val- 
ue Of Upp in the mt control region on previous 
estimates of preexploitation abundance of 
North Atlantic humpback whales inferred from 
contemporary levels of genetic diversity in the 
mt control region (18, 19, 38). Insights into the 
preexploitation abundance of baleen whales 
are not only important when assessing current 
levels of recovery but also provide fundamen- 
tal insights into the overall state of the historic 
oceans and the biomass that the oceans sup- 
ported before overexploitation. To evaluate 
the effect of using our estimate of upgp, we 
replicated the approach and the parameter 
values used in the previous studies (78, 19), 
with the exception of upgp. The resulting esti- 
mate of the effective female population size 
was 2561 (IQR: 1245 to 7128), yielding an over- 
all effective population size of 5122, which, in 
turn, translated to an overall census abun- 
dance of 20,488 humpback whales (IQR: 9964 
to 57,030). This newly calculated estimate is 
86% lower than the lowest upyy estimate 
based on mt control region sequence diversity 


Fig. 3. Changes in the frequencies of a mitochondrial point heteroplasmy 
after the transmission bottleneck during oocyte development. Heteroplasmy 
present in a humpback whale and three of her offspring detected by Sanger 
sequencing (chromatogram peak height, left-hand square) and massive parallel 


Suarez-Menéndez et al., Science 381, 990-995 (2023) 


[150,000, credited to (19) by (38)]. The abun- 
dance based on our estimate of upp is sim- 
ilar to nongenetic estimates of preexploitation 
humpback whale abundance [17,151 to 22,647 
(20, 38)] (Fig. 2B). Our result demonstrates 
how a downward-biased estimate of 1, inferred 
from Upyzy, leads to an overestimate of preex- 
ploitation abundance. The revised upgp-based 
estimate is much closer to the most recent 
estimate of the abundance of North Atlantic 
humpback whales [10,752 (39)]. Accordingly, 
our revised estimate implies that the pre- 
exploitation oceans likely supported a lower 
biomass of baleen whales than suggested by 
the earlier genetic studies. However, estimat- 
ing historic whale abundance in this manner 
is subject to several uncertainties, such as mi- 
gration, population size changes, and generation 
time (5). In its simplest form, the contempo- 
rary level of genetic diversity represents the 
temporal harmonic mean of the effective pop- 
ulation size and thus not necessarily the actual 
abundance at a specific, narrowly defined time 
period—in this case, immediately before the 
onset of whaling (5). Therefore, we also applied 
a linkage disequilibrium-based approach to 
the data obtained from whole genomes of the 
unrelated individual humpback whales to es- 
timate the effective population size before the 
start of commercial whaling (30 generations 
ago). This estimation yielded an average effec- 
tive population size of 5700 individuals [stan- 
dard deviation (SD): 685], which was stable at 
an average of 5263 (SD: 1104) between 30 and 
200 generations ago and thus very similar to 
our revised estimate based on upgp estimated 
for the mt control region. However, and per- 


haps most importantly, our reanalysis demon- 
strates how sensitive these inferences are to 
changes in the underlying parameter values 
(5), which also applies to nongenetic estimates 
of historic abundance (20). 


Conclusions 


The rate and nature of de novo mutations are at 
the center of evolution itself and key to many 
ecological and evolutionary processes. Differ- 
ences in 1 among taxonomic groups have fos- 
tered an array of intriguing hypotheses aimed 
at understanding the underlying causes (J). 
However, the technical caveats in the estima- 
tion of Uppy and Uanc render direct compar- 
isons among different species and studies 
difficult. In contrast, estimates of Upp are di- 
rectly comparable among species, and conse- 
quently, this approach holds great potential to 
facilitate and improve such comparisons. Our 
study expands on recent reports of pedigree- 
based estimation of p in two crucial aspects. 
First, we show that it is feasible to estimate 
Upp in wild populations in a cost-effective 
manner by conducting an initial screening of 
samples using inexpensive microsatellite data. 


We show that trios can be reliably detected ‘ 


and confirmed using whole-genome sequenc- 
ing in the absence of known pedigrees, even in 
notoriously difficult-to-study large mammals 


with exceptionally wide ranges. Second, our ‘ 


estimates of upgp in the gigantic baleen whales 
were similar to estimates of upgp in the com- 
paratively smaller-bodied primates and toothed 
whales, both for the nuclear and mt genome, 
thus negating a major effect of gigantism on i. 
We note that upgp estimates are not necessarily 
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sequencing (read frequencies, right-hand square). The relative frequencies of the 
two mt haplotypes were reversed in offspring 2 compared with the mother. In 
offspring 3, the heteroplasmy was undetectable by Sanger sequencing but detected 
among the reads. [Whale illustrations by Frédérique Lucas] 
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directly applicable as a substitute for p in all 
instances. This is particularly the case when 
results are based on analytical methods that 
integrate over extensive time periods, when 
populations are subject to changes in abun- 
dance and connectivity, selection, and multi- 
ple substitutions at the same nucleotide sites. 
As an example, in some cases, robust infer- 
ences require applying a mutation model that 
accounts for multiple substitutions, which 
may result in a “time dependence” of mutation 
rates (40). However, as sample sizes increase 
in studies aiming to detect de novo mutations 
in pedigrees, a much more detailed insight will 
be gained into both the nature and distribu- 
tion of de novo mutations across the genome. 
In turn, these insights will enable refinement 
of current mutation models and thereby aid in 
improving evolutionary and population ge- 
netic inference methods in general. We also 
note that while the generational mutation 
rate might be similar among larger mammals, 
generation times can differ substantially among 
species, resulting in lower annual mutation 
rates in long-lived species. This generation 
time effect comes into play when inferences 
based on upgp require conversion into years 
(e.g., historic abundances and population di- 
vergence times). Given the relative ease with 
which trios can be identified in large sample 
sizes by inexpensive microsatellite genotyp- 
ing and given the low costs of whole-genome 
sequencing, robust estimates of Upgp are likely 
to become available across a wide range of 
biodiversity, facilitating novel insights into 
many different fundamental and applied as- 
pects that relate to u. 
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CALCULUS INSTRUCTION 


Establishing a new standard of care for calculus 
using trials with randomized student allocation 


Laird Kramer’2*, Edgar Fuller’**, Charity Watson’*4, Adam Castillo'**+, 


Pablo Duran Oliva'?>, Geoff Potvin’? 


Calculus, the study of change in processes and systems, serves as the foundation for many STEM disciplines. 
Traditional, lecture-based calculus instruction may present a barrier for students seeking STEM degrees, 
limit their access to STEM professions, and block their potential to address society's challenges. A large-scale 
pragmatic trial with randomized student allocation was conducted to compare two calculus instruction 
styles: active student engagement (treatment condition) versus traditional, lecture-based instruction (control 
condition). A sample of 811 university students were studied across 32 sections taught by 19 instructors 
over three semesters at a large, US-based Hispanic-serving institution. Large effect sizes were consistently 
measured for student learning outcomes in the treatment condition, which demonstrates a new standard 
for calculus instruction and increased opportunities for completion of STEM degrees. 


alculus instruction needs substantial trans- 
formation because it is often a barrier to 
STEM degree attainment, especially for 
traditionally underrepresented groups 
(1-3), depriving both individuals and so- 
ciety of the potential benefits of their inclu- 
sion. National calls for calculus transformation 
are numerous (4, 5), because failing calculus 
can contribute to a student’s departure from 
STEM degree programs. Only ~40% of stu- 
dents entering universities with STEM degree 
intentions actually graduate with a STEM de- 
gree (6). More concerning is that the odds of 
female students switching out of a STEM pro- 
gram after a calculus course is ~1.5 times higher 
than that of comparable male students (3). Fur- 
thermore, Hispanic and Black/African American 
students had >50% higher failure rates than 
white students in calculus (7, 8). 
Evidence-based instruction, which is imple- 
mented in many STEM disciplines, has reliably 
led to profound improvement in student suc- 
cess (9-11). However, common approaches to 
calculus instruction continue to rely on tradi- 
tional, lecture-based practices in which students 
are passive learners in the classroom and are 
expected to construct their knowledge mostly 
outside of the classroom by doing homework or 
in recitation sessions (12). Mathematics as a dis- 
cipline thus needs to embrace its role in enabling 
STEM careers that will lead to prosperity for 
both individuals and society at large. “Calculus ... 
must become a pump and not a filter” for the 
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STEM pipeline, as noted in 1988 by Robert 
White, president of the National Academy of 
Engineering (13). Handlesman et al. (14) re- 
cently argued that “we must fix the classrooms 
where many students from historically excluded 
communities are discouraged from pursuing 
STEM” and that “... the continued exclusive use 
of lectures is malpractice at best, or an act of 
discrimination at worst.” Thus, it is imperative 
that substantial transformation in calculus in- 
struction takes place to promote more equita- 
ble learning environments for all students. 
We present a large-scale trial featuring ran- 
domized student allocation into treatment or 
control conditions to rigorously compare an 
evidence-based, active student engagement 
calculus course against traditional, lecture-based 
calculus instruction. The work extends prior cal- 
culus research investigations (15-17) by includ- 
ing random assignment of students to treatment 
and control sections, as well as anonymized 
analysis of the identical end-of-semester learn- 
ing outcomes. The study uses a pragmatic (18) 
design with random allocation of students to 
inform on the effectiveness of similar inter- 
ventions at higher education institutions, re- 
flecting real-world classroom constraints. In 
these contexts, blinding of the treatment and 
control conditions to both students and faculty 
is not possible because blinding is only feasible 
when the treatment and control conditions re- 
main unknown to the participants during the 
period of study (such as in a clinical trial drug 
study). As with other public health or sociologi- 
cal interventions, enrollment of participants 
in this study revealed some aspects of a cohort 
structure, but it was still possible to maintain 
the essential aspects of random assignment by 
following a modified protocol, as was done in 
Zwarenstein et al. (18). The treatment condi- 
tion integrated a suite of coherent strategies 
that have been independently found (79, 20) 
to improve student learning; thus, the treat- 
ment was a distinct departure from tradition- 
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al instruction, and it was not logically Poss chee 
for the treatment condition to remain hic. 
from students or faculty after the treatment be- 
gan. Random assignment of faculty to control 
or treatment conditions would not be possible 
because an individual faculty member’s knowl- 
edge, philosophy, and experience with a variety 
of classroom strategies and instructional prac- 
tices may intersect with the features of the treat- 
ment or control conditions. The experimental 
protocol thus included a group of instructors 
willing to adopt the instructional methods in 
the treatment condition. This comprehensive 
experimental approach was intended to secure 
the strongest possible evidence for critical stake- 
holders to sustain the treatment beyond the trial. 

The treatment condition used the modeling 
practices in calculus (MPC) curriculum and 
pedagogy, and the control condition represented 
the preexisting, traditional instructional prac- 
tices at the study institution. MPC integrates 
the practices of mathematicians as a central 
design tenet throughout the course. Instructors 
facilitate students’ application of mathematical 
“habits of mind” (27) that foster deeper under- 
standing of calculus concepts, including the 
identifying of patterns, hypothesis development ‘ 
and testing, making connections, and communi- 
cating ideas precisely to learn calculus through- 
out the course. Class time is devoted to students 
working collectively in small groups on prede- ‘ 
signed notes and learning activities developing 
their calculus understanding with minimal lec- 
turing. Treatment included learning assistants 
(22) who were undergraduate peers integrated 
within the instructional team to facilitate student 
learning and promote culturally responsive ins- 
truction. The curriculum promotes mathemati- 
cal practices (sense-making, problem solving, 
argumentation, etc.) and established strategies 
to optimize student engagement, including coop- + 
erative learning, argumentation and metacogni- ‘ 
tion, mathematical fluency, and a culturally 
responsive environment (23) (described in the 
supplementary materials section 2). The MPC - 
design builds on the SCALE-UP calculus model 
(24) and intentionally embodies well-established 
recommendations for calculus instruction, in- 
cluding ambitious teaching practices and strate- 
gies promoted by national mathematics societies 
and national reports (12, 20, 25-28). 

The study was performed at Florida Interna- 
tional University (FIU) in Miami, Florida, the 
fourth-largest public research university in the 
United States, with 58,787 students, of which 
41,795 are undergraduates as of fall 2019 (29). 
FIU is a Hispanic-serving institution, with 64% 
of students identifying as Hispanic/Latino/a/. 
Moreover, 79% of the students identify as mem- 
bers of historically underrepresented racial/ 
ethnic minority groups, and 57% are women. 
The institution’s size provided a unique oppor- 
tunity to perform this study because there are 
18 to 34 40-student sections of calculus 1 being 
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taught each semester and primarily serving 
STEM majors. Furthermore, institutional con- 
ditions created urgency to transform calculus 
because historic pass rates in introductory cal- 
culus averaged 55% (range 13 to 88%) over the 
six semesters before the project’s pilot. 


Research design 


A pragmatic trial (30-32) of the MPC approach 
was performed during the fall 2018, spring 
2019, and fall 2019 semesters to rigorously test 
student outcomes. Students were randomly 
assigned individually to treatment and control 
conditions at the beginning of the semester 
after enrolling in sections on the basis of their 
scheduling preferences using the institution’s 
enrollment system. To accommodate the ran- 
domized assignments, each of the experimen- 
tal sections doubled in size from the usual 40 
seats to 80 seats before enrollment opening. 
Instructor names and section sizes were invis- 
ible to students throughout the enrollment 
phase. Just before each term, the 80-seat sec- 
tions were split into two 40-seat sections by 
assigning each student at random to either a 
treatment or control section. 

Once assigned, the treatment sections im- 
plemented the MPC approach, whereas the 
control sections were unchanged. After assign- 
ment, students were free to change/drop/add 
course sections up until the regular institutional 
drop/add deadline (7 days after classes began). 
To account for such changes, enrollments were 
monitored, and only students who were ran- 
domly assigned to either a treatment or control 
section and remained in that section through 
the regular, nonpenalty drop/add deadline were 
included in the data for the experimental study 
reported below. In total, 1019 students were 
randomly assigned to either the treatment or 
control groups. Of these, 516 students were 
assigned to the treatment group, and 417 re- 
mained in the section at the drop/add deadline. 
At the same time, 503 students were assigned 
to the control group and 394: students remained 
in the section at the drop/add deadline. Our 
study followed the Consolidated Standards 
for Reporting Trials (CONSORT) guidelines 
(18, 33, 34). The specifics of recruiting, enroll- 
ment, assignment, and completion for the trial 
are discussed in sections 1.3 and 3.1 of the 
supplementary materials. The randomization 
process produced comparable groups by mathe- 
matical background and demographics; class 
sizes were typical for the course (see section 3 
of the supplementary materials). 

Faculty participating in the study included 
seven individuals teaching 16 treatment sections, 
along with 12 individuals teaching 16 control 
sections. Faculty recruited to teach the treat- 
ment sections indicated a willingness to adopt 
and implement the MPC approach, replicating 
the authentic condition of faculty reforming 
their classroom practice under the study design. 
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To prepare for the new instructional approach, 
faculty participated in a 2-day, pre-semester 
professional development workshop and were 
provided with the MPC curricular materials. 
Consistency of the MPC treatment was moni- 
tored through weekly preparation meetings in 
which the course objectives and pacing were 
discussed. In-class monitoring by the project 
team was deemed overly intrusive and disrup- 
tive to classroom engagement. Control section 
faculty were not guided to use any particular 
practices and chose their normal instructional 
approach, best described as traditional lecture 
format with at most limited student engage- 
ment. Potential effects of instructor differences 
on learning outcomes were investigated and 
are presented in section 3 of the supplementary 
materials and summarized below. 

The student outcome measures reported in- 
clude identical end-of-semester learning mea- 
sures, as well as course success data (i.e., course 
grades). The end-of-semester learning measures 
focused on evaluating learning using a set of 
identical assessment items (problems) developed 
by instructors spanning all calculus sections and 
spanning the major learning objectives of a cal- 
culus 1 course. The aim was to determine how 
well students understood essential elements of, 
and exhibited fluency and technical competency 
in, calculus at course end. Assessment items 
aligned to both local and national standards (35) 
were embedded in a cumulative final examina- 


Effect Sizes for End of Semester Learning Measures by Group 
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Fig. 1. Overall end-of-semester learning mea- 
sures effect sizes (Cohen’s d) broken out by 
major, race/ethnicity, and gender. Error bars 
indicate the 95% Cl for effect size for each group. 
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Fig. 2. Overall course success (earned grades of 
A, B, or C) effect sizes (Cohen’s d) broken out 

by major, race/ethnicity, and gender. Error bars 
indicate the 95% Cl for effect size for each group. 
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tion and were administered to all students in 
each treatment and control section. To ensure 
fidelity and fairness to both treatment and 
control sections, control and treatment faculty 
collaboratively developed a set of items to be 
administered to both conditions in identical for- 
mat and wording. This set of identical items 
formed roughly two-thirds of the total final exa- 
mination content, with the remaining items 
added by individual faculty in a separate section 
of the examination, allowing them to address 
their specific instructional goals. Furthermore, 
the examinations and problems were formatted 
identically and without course section identifiers 
to allow completely anonymized evaluation dur- 
ing the subsequent comparative analysis. The 
identical items covered core calculus topics in- 
cluding evaluating limits, identifying extrema, 
curve sketching, related rates, and evaluating 
indefinite integrals. For the second and third 
semesters, additional items focusing on implicit 
differentiation and optimization were added 
to the identical set of items (for details, see sec- 
tion 3.3 of the supplementary materials). Course 
success data (grades) reflect the overall assess- 
ment of students as assigned by each section’s 
instructor. Course grade policies were estab- 
lished by individual instructors following depart- 
mental syllabus guidelines and were broadly 
consistent across sections and semesters. 

Analysis of the end-of-course learning mea- 
sures used a rubric for each problem, with five 
researchers testing the initial rubric on a subset 
of examinations to establish interrater reliability. 
The final rubric represented consensus on all 
elements and accounted for initial ambiguity 
or disagreement. The analysis was performed by 
ateam of 10 trained evaluators, each of whom 
evaluated a completely anonymous set of stu- 
dent solutions. An average of two evaluators 
reviewed each solution for correctness on a 
scale from 0 to 100%. The evaluators were very 
consistent and interrater reliability was high 
(Cohen’s kappa 0.827 in fall 2018 and 0.797 in 
spring 2019) (36, 37). The same rubric was ap- 
plied to the fall 2019 data given its high degree 
of agreement. Once all problems were evaluated, 
the research team deanonymized and sorted the 
results by treatment and control sections for 
the comparative analysis. 


Results 


The results indicate significant improvements 
in student learning for the MPC group across 
all three semesters. Students in the treatment 
group showed substantially higher scores on 
the identical end-of-semester learning outcomes: 
(fall 2018: d = 0.505, P < 0.01; spring 2019: d = 
0.748, P < 0.001; fall 2019: d = 0.925, P < 0.001) 
compared with the control group. Combining 
results from all three semesters of trials (i.e., 
32 sections and 811 total students), the overall 
standardized mean difference between treat- 
ment and control was d = 0.774 [95% confidence 
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interval (CI) = 0.618 to 0.930] at the individual 
level, a medium/large effect size (36, 37). An ad- 
justed effect size (38) was computed using section- 
level cluster properties and the mixed-effects 
model structure described below and was found 
to be dy = 0.771 (95% CI = 0.468 to 1.073; see sec- 
tion 3.4.2.1 of the supplementary materials). 

The success of the MPC intervention could 
be seen across racial and ethnic groups, majors 
and academic pathways, and genders (Fig. 1). 
Similar medium/large overall effect sizes were 
observed for students in the treatment condi- 
tion who identified as Black/African American 
(d = 0.882, P < 0.001) or Hispanic/Latino/a/ 
(d = 0.772, P < 0.001) when directly comparing 
the identical learning measures with their coun- 
terparts in the control condition. Although all 
STEM majors showed significantly improved 
learning, there were larger effect sizes for biology 
majors in the treatment group (d = 0.925, P < 
0.001). Students matriculating onto campus as 
both first time in college (FTiC) and transfer 
students showed medium/large effect sizes and 
most were FTIC. Overall, treatment group stu- 
dents show more consistency in applying the 
tools of calculus to optimization problems, using 
derivatives to sketch graphs of functions, and 
in the evaluation of limits and integrals. 

Potential biases arising in the random stu- 
dent assignment and faculty selections were 
investigated for hidden-level effects or confound- 
ers to establish limitations of the study (see sec- 
tion 3 of the supplementary materials). Random 
allocation of students provided equivariance in 
the demographics of student populations. Analy- 
ses showed that allowing students to drop/add 
sections during the open registration period 
after the initial assignment did not affect the 
measured outcomes. Faculty characteristics were 
compared and found to be similar in both back- 
ground and prior course student grade distribu- 
tions. A mixed-effects model with student fixed 
effects and random cluster effects due to section 
and instructor levels was fit with tests of fixed 
effects computed using Satterthwaite approxi- 
mations (see section 3.2.4.1 of the supplemen- 
tary materials). The explanatory power of the 
model was found to be high (conditional R? = 
0.39, marginal R” = 0.31). The effect of treat- 
ment was statistically significant (¢660) = 5.68, 
P< 0.001, semipartial R? = 0.119) (39), implying 
an estimated effect size of Cohen’s f = 0.368 
with covariates and cluster-level effects present. 
Random effects correlated with 0.085 of outcome 
variance (intraclass correlation coefficient = 0.13 
with demographic covariates). A sensitivity anal- 
ysis (see section 3.2.4.3 of the supplementary 
materials) showed that unmeasured confound- 
ers would need to be four times more powerful 
than any measured covariate, including student 
mathematics background, to be responsible for 
the observed effect. 

Furthermore, students in the MPC treatment 
condition had improved course grades. Average 
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grades were significantly higher by ~0.4 points 
(4.0 grade point scale) in MPC sections across 
all semesters of the study (P < 0.001, d = 0.295). 
This translated to success rates (A, B, or C grades) 
averaging 11% higher in MPC sections compared 
with traditional sections (P < 0.001, d = 0.251; 
Fig. 2). Outcomes were consistent across the 
three semesters of the experiment (Fig. 3). More- 
over, the MPC sections also had lower course 
late drop rates (departure after the regular drop/ 
add period ends) across all three semesters 
(P < 0.05, d = 0.141), suggesting that students 
more clearly perceived that they were likely to 
succeed in the course. 

The trend of improved outcomes in course 
success was also observed for demographic 
subgroups, as can be seen in Fig. 2. A logistic re- 
gression model of success using gender identifi- 
cation, FTiC status, and Hispanic identification 
as independent variables showed the odds of a 
female-identified student in the treatment group 
passing the course to be 58% higher than the 
odds of a female-identified student in the control 
group (0, = 0.46, P < 0.05). Hispanic students’ 
odds of passing the course were almost double 
that of their counterparts in the control group 
(0, = 0.70, P < 0.001). The likelihood of FTiC 
students in the treatment group passing the 
course increased by ~85% compared with FTiC 
students in the control group (0; = 0.61, P < 0.01) 
(for details, see section 3.4 of the supplementary 
materials). 


Discussion and conclusions 


This pragmatic trial demonstrates that stu- 
dent learning outcomes were significantly im- 
proved in the treatment condition. Contrary to 
previous research (40), this study shows that 
when students are expected to engage with 
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Percent of Student Success (A/B/C), Fail, and Late 
Drop by Term and Assignment Group with Std Error 


Control Treatment 
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Spring 2019 


iLL 


Control 


calculus concepts collaboratively using inten- 
tional, evidence-based teaching strategies, they 
develop a better understanding of calculus con- 
cepts and techniques. The benefits of the MPC 
curriculum and pedagogy were realized regard- 
less of racial/ethnic group, gender, or major/ 
academic pathway. These trends suggest that 
the treatment includes culturally responsive and 
equitable strategies. Specifically, the MPC learn- 
ing environment is designed to promote learn- 
ing communities that provide ongoing support 
for learning mathematics through collaborative 
engagement and ongoing formative feedback. 
This aims to promote inclusion and increases 
access for students with different mathematical 
backgrounds, cultural identities, and life expe- 
riences by allowing them to use their mathe- 
matics skills in a supportive, nonthreatening 
environment. 

The improved learning and course success 
for modeling practices in calculus reported in 
this study have profound implications for cal- 
culus instruction. This study demonstrates the 
substantial benefit to students of the MPC ap- 
proach designed around established, evidence- 
based principles, and should motivate educators 
in mathematics and other STEM disciplines to 
adopt the same or similar approaches and 
conduct similar studies to replicate these find- 
ings. Improved student success also leads to 
more efficient student progress to graduation 
and boosts institutional effectiveness. Applying 
this study’s 11% average improvement in pass 
rate to all 2000 first-time calculus students at 
FIU would translate to 220 additional students 
succeeding in calculus annually and reducing 
the instructional load by five sections annually. 
Extending this strategy to the ~300,000 stu- 
dents across the nation taking calculus 1 each 
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Outcome 


ial Late Drop 
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Fig. 3. Final course grade outcomes broken out by term and curriculum, including success (earned 
grades of A, B, or C), fail (earned grades of D or F), and late drops (withdrawals or drops after the 
institution’s drop/add deadline). Vertical scale is percentage by outcome. Error bars indicate the 95% Cl 
for the mean percentage of students in each outcome group over all sections in a term. 
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year, these results translate to the potential for 
an additional 33,000 students passing calculus 
each year, saving students an estimated $23.9 
million on tuition [based on a three-credit course 
at the average public college/university tuition 
rate of $242/credit (41, 42)]. Pragmatic trials 
provide guidance on what can be achieved by 
engaging faculty willing to change their ins- 
truction. These results potentially represent a 
lower bound on the long-term effects because 
faculty likely develop additional expertise 
through continued instruction and realize im- 
proved outcomes. The measured effect size pro- 
vides the rationale to stop the control due to 
treatment benefit if one follows medical re- 
search protocols (43, 44). 


The experimental methodology described here 


establishes a new standard of care for calculus 
instruction and a high standard of evidence to 
bear on understanding the impacts on student 
learning. Improved learning of calculus aims to 
foster higher success in future STEM courses 
and develop the STEM “habits of mind” that stu- 
dents take with them into their future careers. 
Further, MPC shows the potential to address 
the disparities that differentially affect histori- 
cally underrepresented groups, thus offering a 
mechanism to address Handelsman et al.’s 
(14) call to promote the success of historically 
excluded communities. We envision a mathe- 
matics experience for all students built on this 
approach and recommend that active student 
engagement must be deployed across all STEM 
disciplines to improve our development of fu- 
ture STEM professionals from all backgrounds. 
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A principal odor map unifies diverse tasks in 


olfactory perception 


Brian K. Lee*+, Emily J. Mayhew™*t, Benjamin Sanchez-Lengeling', Jennifer N. Wei’, 
Wesley W. Qian*”, Kelsie A. Little?, Matthew Andres’, Britney B. Nguyen’, Theresa Moloy”, 
Jacob Yasonik™, Jane K. Parker®, Richard C. Gerkin**”, Joel D. Mainland”**, Alexander B. Wiltschko*** 


Mapping molecular structure to odor perception is a key challenge in olfaction. We used graph 

neural networks to generate a principal odor map (POM) that preserves perceptual relationships and 
enables odor quality prediction for previously uncharacterized odorants. The model was as reliable as a 
human in describing odor quality: On a prospective validation set of 400 out-of-sample odorants, the 
model-generated odor profile more closely matched the trained panel mean than did the median 
panelist. By applying simple, interpretable, theoretically rooted transformations, the POM outperformed 
chemoinformatic models on several other odor prediction tasks, indicating that the POM successfully 
encoded a generalized map of structure-odor relationships. This approach broadly enables odor 
prediction and paves the way toward digitizing odors. 


fundamental problem in neuroscience 
is mapping the physical properties of a 
stimulus to perceptual characteristics. In 
vision, wavelength maps to color; in audi- 
tion, frequency maps to pitch. By con- 
trast, the mapping from chemical structures to 
olfactory percepts is poorly understood. Detailed 
and modality-specific maps such as the Com- 
mission Internationale de lElcairage (CIE) color 
space (7) and Fourier space (2) led to a better 
understanding of visual and auditory coding. 
Similarly, to better understand olfactory cod- 
ing, the field of olfaction needs a better map. 
Pitch increases monotonically with frequen- 
cy. By contrast, the relationship between odor 
percept and odorant structure is riddled with 
discontinuities; this is exemplified by Sell’s 
triplets (3), which are trios of molecules in 
which the structurally similar pair is not the 
perceptually similar pair (Fig. 1A). These dis- 
continuities in the structure-odor relation- 
ship suggest that standard chemoinformatic 
representations of molecules—functional group 
counts, physical properties, molecular finger- 
prints, and so on—that have been used in re- 
cent odor modeling work (4-6) are inadequate 
to map odor space. 


The principal odor map represents perceptual 
distances and hierarchies 

To generate odor-relevant representations of 
molecules, we constructed a message passing 
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neural network (MPNN) (7), which is a spe- 
cific type of graph neural network (GNN) (8), 
to map chemical structures to odor percepts. 
Each molecule was represented as a graph, 
with each atom described by its valence, de- 
gree, hydrogen count, hybridization, formal 
charge, and atomic number. Each bond was 
described by its degree, its aromaticity, and 
whether it is in a ring. Unlike traditional fin- 
gerprinting techniques (9), which assign equal 
weight to all molecular fragments within a 
set bond radius, a GNN can optimize fragment 
weights for odor-specific applications. Neural 
networks have unlocked predictive modeling 
breakthroughs in diverse perceptual domains 
[e.g., natural images (10), faces (17), and sounds 
(12)] and naturally produce intermediate rep- 
resentations of their input data that are func- 
tionally high-dimensional, data-driven maps. 
We used the final layer of the GNN (henceforth, 
“our model”) to directly predict odor qualities, 
and the penultimate layer of the model as a prin- 
cipal odor map (POM). The POM (i) faithfully 
represented known perceptual hierarchies and 
distances, (ii) extended to out-of-sample (here- 
after, “novel”) odorants, (iii) was robust to dis- 
continuities in structure-odor distances, and 
(iv) generalized to other olfactory tasks. 

We curated a reference dataset of ~5000 mol- 
ecules, each described by multiple odor labels 
(e.g., creamy, grassy), by combining the Good 
Scents (73) and Leffingwell & Associates (74) 
(GS-LF) flavor and fragrance databases (Fig. 
1B). To train our model, we optimized model 
parameters with a weighted-cross entropy loss 
over 150 epochs using Adam (15) with a learn- 
ing rate decaying from 5 x 10* to1 x 10° and 
a batch size of 128. The GS-LF dataset was split 
80/20 training/test, and the 80% training set 
further subdivided into five cross-validation 
splits. These cross-validation splits were used 
to optimize hyperparameters using Vizier (16), 
a Bayesian optimization algorithm, by tuning 
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across 1000 trials. Details about model al Se 
tecture and hyperparameters are givel. 
the supplementary methods. When properly 
hyperparameter-tuned, performance was found 
to be robust across many model architectures. 
We present results for the model with the high- 
est mean area under the receiver operating 
characteristic curve (AUROC) on the cross- 
validation set (AUROC = 0.89) (17). 

How does the POM compare to perceptual 
odor maps and conventional structure-based 
maps of odorants? Empirical perceptual space 
(Fig. 1D) intuitively represents perceptual dis- 
tances (e.g., two molecules that smell of jas- 
mine should be nearer to each other than to a 
beefy-smelling molecule) and hierarchies (e.g., 


jasmine and lavender are subtypes of the floral 


odor family). We show that whereas this struc- 
ture is lost in Morgan fingerprint-based maps 
of odorant space (Fig. 1E), the POM preserves 
relative perceptual distances and hierarchies 
(Fig. IF and figs. S1 to $3). 


Our model outperformed the median panelist 
on the prospective validation task 


To test whether our model extends to novel 
odorants, we designed a prospective valida- ‘ 
tion challenge (78) in which we benchmarked 
model predictive performance against indi- 
vidual human raters. In olfaction, no reliable 
instrumental method of measuring odor percep- * 
tion exists, and trained human sensory panels 
are the gold standard for odor characterization 
(19). Odor perception is variable across indi- 
viduals (20, 27), but group-averaged odor rat- 
ings are stable across repeated measurements 
(22) and represent our best avenue to estab- 
lish the ground-truth odor character for novel 
odorants. We trained a cohort of subjects to 
describe their perception of odorants using 
the rate-all-that-apply method (RATA) anda « 
55-word odor lexicon. During training sessions, ‘ 
each term in the lexicon was paired with visual 
and odor references (fig. S4 and table S1). Only 
subjects that met performance standards on - 
the pretest of 20 common odorants (data S2; 
individual test-retest correlation R > 0.35; rea- 
sonable label selection for common odorants) 
were invited to join the panel. 

To avoid trivial test cases, we applied the fol- 
lowing selection criteria for the set of 400 novel 
odorants: (i) molecules must be structurally 
distinct from each other (fig. S5), (ii) mole- 
cules should cover the widest gamut of odor 
labels (data S1), and (iii) molecules must be 
structurally or perceptually distinct from any 
training example (e.g., Fig. 1A and data SI). 
Our prospective validation set consisted of 55- 
odor label RATA data for 400 novel, intensity- 
balanced odorants generated by our cohort of 
215 panelists (two replicates). Summary sta- 
tistics and the correlation structure of the hu- 
man perceptual data are presented in figs. S6 
to S8. Our panel’s mean ratings were highly 
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Fig. 1. The POM preserves the structure of odor perceptual space. 

(A) Example triplet of molecules in which the structurally similar pair is not the 
perceptually similar pair. (B) The GNN was trained on a curated dataset of 
~5000 semantically labeled molecules drawn from GS-LF. One square represents 
100 molecules. Three example training set molecules and their odor descriptions 
are shown: 2-methyl-2-hexenoic acid (top), 2,5-dimethy|-3-thioisovalerylfuran 
(middle), and 1-methyl-3-hexenyl acetate (bottom). (©) Schematic illustrating the 
process of training a GNN to generate the POM. (D to F) Odorants plotted by the 
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first and second principal components (PCs) of their (D) perceptual labels from 
the GS-LF training dataset (138 labels), (E) cFP structural fingerprints (radius 
four, 2048 bit), and (F) POM coordinates (256 dimensions). Areas dense with 
molecules that have the broad category labels floral, meaty, or alcoholic are shaded; 
areas dense with narrow category labels are outlined. The POM recapitulates 
the true perceptual map, but the fingerprint map does not; note that only relative 
(not absolute) coordinates matter. Additional labels are visualized for the POM 
in fig. Sl. PCA, principal components analysis. 


stable (panel test-retest: R = 0.80, n = 15; fig. 
S9) and more consistent than the DREAM Ol- 
faction Prediction Consortium cohort’s ratings 
(6) (figs. S10 and S11). 

Of the 400 molecules characterized, 77 were 
dropped from the final prospective validation 
set because of low intensity (42) (fig. S12), re- 
dundancy (2), mistaken inclusion (1), confirmed 
contamination (19), or potential contamination 
(13) (data S1). Model performance was eval- 
uated on the remaining 323 molecules without 
model retraining. 
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To measure the model’s performance, we 
compared its normalized predictions with the 
normalized panel mean rating (Fig. 2, A and 
C). One example of raw ratings and predic- 
tions for a single molecule, representative of 
relative GNN and random forest (RF) per- 
formance and panel ratings trends, is given 
in Fig. 2; additional examples are provided 
in the supplementary materials (fig S14). Al- 
though there is considerable variation across 
molecules in the ability of both individual 
raters and the model to match the panel mean 
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ratings, the model output comes closer to the 
panel mean than does the median panelist 
for 53% of molecules (Fig. 2, E and F). No- 
tably, panelists were able to smell each odorant 
as they rated it, whereas the model’s predic- 
tions were based solely on nominal molec- 
ular structure. 

As a baseline comparison, we trained a count- 
based fingerprint (cFP)-based RF model, the pre- 
vious state-of-the-art approach (6), on the same 
dataset (Fig. 2B). This baseline model surpassed 
the median panelist for only 41% of molecules. 
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Fig. 2. The GNN model displays human-level odor description performance. 
(A) GNN model label predictions, (B) RF model label predictions, (C) pane! mean 
ratings with standard error bars, and (D) individual panelist ratings (where 0 means the 
label does not apply and 5 means the label is very applicable), averaged over two 
replicates, for the molecule 2,3-dihydrobenzofuran-5-carboxaldehyde. In (A) to (C), the 
top-five-ranked descriptors are in orange (GNN), purple (RF), or green (panel). 
Descriptors in (A) to (D) are ordered by panel mean ratings. (A) to (D) are annotated 
with the Pearson correlation coefficient of their data to the panel mean rating shown in 
(C). Included in (D) are the panelist-panel correlation coefficients for the panelist 
that best matches the panel mean and for the panelist with the median match. 

(E) Cumulative density plot showing the distribution of correlations between human 
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panelists and the panel mean (green) and between the GNN, RF, and GNN-shuffled 
model predictions and the panel mean on a per molecule basis. Curves shifted 
to the right are more strongly correlated to the panel mean. The dashed horizontal 
line indicates the median molecule; the dotted vertical lines indicate the median 
correlations. The purple and orange symbols indicate the increase and decrease in 
correlation, respectively, of the GNN and RF models relative to median human 
performance. (F) Difference in the median correlation to the panel mean relative to 
the median human subject's correlation to the panel mean for models trained using 
k-nearest neighbor (KNN) and RF, models trained on cFPs or Mordred features, 
and the GNN model. Only the GNN model has a median correlation to the panel 
mean that is higher than that of the median panelist. 
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The GNN model shows human-level perform- 
ance in aggregate, but how does it perform 
across perceptual and chemical classes? When 
we disaggregated performance by odor label, 
the model was within the distribution of hu- 
man raters for all labels except musk and sur- 
passed the median panelist for 30/55 labels 
(55%; Fig. 3A). This per-label view indicates 
that the GNN model is superior to the previous 
state-of-the-art model trained on the same data 
(paired two-tailed Student's ttest, p = 3.3 x 10~). 
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Predictive performance for a given label de- 
pends on the complexity of the structure-odor 
mapping for that label. It is thus unsurprising 
that our model performs best for labels such as 
garlic and fishy that have clear structural de- 
terminants (sulfur-containing for garlic; amines 
for fishy) and worst for the label musk, which 
includes at least five distinct structural classes 
(macrocyclic, polycyclic, nitro, steroid-type, and 
straight chain) (23, 24). By contrast, a panel- 


ist’s performance for a given label depended 
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on their familiarity with the label in the con- 
text of smell; consequently, we observed strong 
panelist-panel agreement for labels that describe 
common food smells such as nutty, garlic, and 
cheesy and weak agreement for labels such as 
musk and hay. 

Model performance also depended on the 
number of training examples for a given label; 
with enough examples, models can learn even 
complex structure-percept relationships. In gen- 
eral, our model's performance was high for labels 
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correlation of GNN (orange) and RF (purple) model predictions and panelist 
ratings (gray) to the panel mean for molecules belonging to 10 common 
chemical classes. (D) Categorization of GC-olfactometry QC results for 

50 test-set stimuli. In the (A) and (C) box plots, the center line represents 

the median, box limits are upper and lower quartiles, and whiskers are minimum 
and maximum values or 1.5 times the inner-quartile range, whichever is closer 
to the median. 


Fig. 3. Model performance is robust across structural and perceptual 
classes. (A) Correlation of GNN (orange) and RF (purple) model predictions 
and panelist ratings (gray) to the panel mean for each of the 55 odor 

labels. (B) GNN model correlation to the panel mean for each of the 55 odor 
labels plotted against the number of molecules in the training data for which 
the label applies. Circle size is proportional to the number of test-set molecules 
for which the label applies. Selected data points are annotated. (C) Mean 
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Fig. 4. The POM is robust to discontinuities in structure-odor mapping. 

(A) Example triplet of molecules in which the structurally similar pair is 

not the perceptually similar pair (i.e., “discordant”), according to the empirical 
odor labels of each molecule. Training-set descriptors (anchor) and mean 

panel ratings (novel odorants) are shown in colored text beneath the molecular 
structure; model-predicted labels are listed in black text. Structural nodes 
highlighted in darker red are those that are more important to model predictions. 
(B) We selected 41 such triplets from the empirical label data without consulting 
the model; by design, 100% of these are discordant and thus represent a difficult 
test for a predictive perceptual model based on molecular structure. Each 
colored line connects molecules in a triplet that share the same anchor, as in (C). 


with many training examples (e.g, fruity, sweet, 
floral) (Fig. 3B), but performance for labels with 
few training examples was either high (e.g., fishy, 
camphoreous, cooling) or low (e.g, ozone, sharp, 
fermented). Likewise, model performance was 
bounded above by panel test-retest correlation 
(fig. S15). When we disaggregated by chemical 
classes (e.g., esters, phenols, amines), both pan- 
elist and model performances were relatively 
uniform (Fig. 3C), with sulfur-containing mole- 
cules showing the strongest performance from 
panelists and the model (R = 0.52). 

Chemical materials are impure—a fact too 
often unaccounted for in olfactory research (25). 
To measure the contribution of impurities to 
the odor percept of our stimuli, we applied a gas 
chromatography-mass spectrometry (GC-MS) 
and gas chromatography-olfactometry (GC-O) 
quality control (QC) procedure to 50 stimuli 
(data S1). This QC procedure matches an odor 
percept to its causal molecule, which allowed 
us to identify stimuli for which the primary 
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those from Figs. 2 and 3. 


odor character was not due to the nominal 
compound. We selected the 50 stimuli to rep- 
resent three quadrants of intrapanel agree- 
ment and model-panel agreement (high-high, 
high-low, and low-high), anticipating that odor- 
ous contaminants may explain cases of poor 
model-panel agreement. Our QC procedure led 
to diverse conclusions: The nominal compound 
caused the odor (11/50), contaminants con- 
tributed to the odor in a minor (15/50) or major 
(5/50) way, contaminants caused the odor 
(15/50), or the cause of the odor could not be 
determined (4/50) (Fig. 3D). Fishy, garlic, and 
sulfurous were the most prevalent contami- 
nant odor qualities (fig. S17); these labels were 
overrepresented in the QC set, so we expected 
that the rate of severe contamination is likely 
lower in the full test set than in the QC set. In 
some cases, although we purchased a novel 
odorant, the dominant odorant was not novel; 
for example, the stimulus 4,5-dimethyl-1,3-thiazol- 
2-amine was described by the panel as buttery, 
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(C) Diagram of the psychophysical task in which panelists rated explicit 
perceptual distances between molecules in triplets. (D) Experimentally measured 
explicit perceptual distance ratings in the same triplets also show high discordance 
with structural distance, that is, the molecule that is more structurally similar 
to the anchor is usually (90% of the time) less perceptually similar. (E) The 
GNN model-predicted labels agree with the counterintuitive-but-correct perceptual 
relationship 50% of the time, that is, they correctly predict the empirical 
discordance half of the time, as measured by the cosine distance of the 
predicted, binarized labels. (F) A baseline model correctly predicts the empirical 
discordance only 19% of the time. The models in (E) and (F) are the same as 


sweet, and dairy, but this odor percept was at- 
tributed through QC to the contaminant diacetyl, 


a well-known buttery odorant. In another case, the - 


purchased odorant, isobornyl methylacrylate, 
was described by the panel and the model as 
both piney and floral; however, the nominal 
compound was floral only and the piney aroma 
was due to the closely related compound, bor- 
neol, which was detected as a contaminant in 
the sample. Based on QC results, we removed 
32 molecules known or suspected to have high 
degrees of odorous contamination (data S1). 
There were various QC results, each with dis- 
tinct implications on model performance (data 
S1). In some cases, the model performed well de- 
spite the presence of odorous contaminants. We 
estimate that if these contaminants were re- 
moved from the rated samples, model perform- 
ance would improve in 6 of 50 scenarios, would 
degrade in another 6 of 50 scenarios, would re- 
main neutral in 21 of 50 scenarios, and would be 
indeterminate in 17 of 50 scenarios. We estimate 
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Fig. 5. The POM solves a fundamental set of olfactory prediction tasks. (A) A two-dimensional trimap 
embedding of 500,000 unique likely odorants that were previously uncharacterized. The position of each point 
(molecule) is determined by POM coordinates, and the RGB values of each point correspond to their coordinates 
in the first three dimensions of a non-negative matrix factorization of the predicted odor labels. The inset plot 
shows the known odorants from the GS-LF training set (~5000) in color superimposed over the likely odorants in 
gray. (B) Intuitive geometric measures like vector length, vector distance, and vector projection correspond to the 
odor prediction tasks of odor detectability, similarity, and descriptor applicability. The equation shows that the 
projected space Y represents the dot product between POM and a task-specific projection matrix X. (€) A linear 
model using POM coordinates outperforms a chemoinformatic SVM baseline at predicting odor applicability 

on two extant datasets, provided by Dravnieks (35) and the DREAM Olfaction Prediction Consortium (6), as well as 
the current data. Error bars represent the standard error of the mean over descriptors. (D) A linear model 
using POM coordinates outperforms a chemoinformatic SVM baseline at predicting an odor detection 
threshold using data from Abraham et al. (36). (E) A linear model using POM coordinates outperforms a 
chemoinformatic SVM baseline at predicting perceptual similarity using data from Snitz et al. (4). 


the overall rate of significant odorous contam- 
ination in our stimulus set at 31.5% (95% con- 
fidence interval: 27.4 to 35.6%) (fig. S18). 


designed an additional challenge in which 41 
new triplets (Fig. 4, A and B) were constructed 
and validated by the panel (as in Figs. 1A and 
4C). In each triplet, the anchor molecule was 
a known odorant and was matched with one 
structurally similar and one structurally dissim- 
ilar novel odorant, where the more structurally 


The POM generalizes to diverse olfactory tasks 


To test whether the POM was robust to dis- 
continuities in structure-odor distances, we 
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dissimilar odorant was the more perceptually 
similar of the two to the anchor. To visualize 
the internal logic of the model, we made small 
changes to each node in a molecule and ob- 
served which perturbations had large effects 
on model predictions; perturbations in nodes 
with a darker red highlight had a larger impact 
(Fig. 4A and fig. S19). Explicit similarity ratings 
agreed with odor profile distances in 90% of 
the triplets (Fig. 4D). The model correctly pre- 
dicted this counterintuitive structure-odor rela- 
tionship in 50% of cases (Fig. 4E), whereas the 
RF model failed in 81% of cases (binomial test 
of proportions, p < 0.01; Fig. 4F). 

A reliable structure-odor map allows us to 
explore odor space at scale. We compiled a list of 
~500,000 potential odorants whose empirical 
properties are presently unknown to science 
or industry; most have never been synthesized 


before. Because a molecule’s coordinates in . 


the POM are directly computable from the 
model, we can plot these potential odorants 
in the POM (Fig. 5A), revealing a potential 
space of odorous molecules that is much larger 
than the space covered by current fragrance 
catalogs (~5000 purchasable, characterized 
odorants; inset of Fig. 5A). These molecules 
would take ~70 person-years of continuous 
smelling time to collect using our trained hu- 
man panel. 

We show that the POM has a meaningful 
interpretation by extracting intuitive, geomet- 
ric measures and mapping them to several 
olfactory prediction tasks (Fig. 5B). The appli- 
cability of any set of odor descriptors corre- 
sponds to a projection of the POM coordinates 
onto axes corresponding to those descriptors; 
odor strength (detectability) corresponds to 
the magnitude of this projection (fig. S13), and 
odor similarity corresponds to the distance 
between such projections for different mole- 
cules. A simple linear model applied to the 
POM that uses these geometric interpretations 
had comparable or superior performance to 
a chemoinformatic support vector machine 
(SVM) model across multiple published data- 
sets (Fig. 5, C to E), collectively representing 
some of the most thorough previous public 
efforts to characterize these features of odor. 


Discussion 


There is no universally accepted method for 
quantifying and categorizing an odor percept. 
Several systems of odor classification have been 
proposed: first, intuitive categorizations (26); 
then empirically supported universal spaces 
(27, 28); and later, attempts to incorporate 
receptor mechanisms (29, 30). However, these 
systems do not tie stimulus properties to percep- 
tion, and none have reached broad acceptance. 
Here, we propose and validate a data-driven, 
high-dimensional map of human olfaction. This 
map recapitulates the structure and relation- 
ships of odor perceptual categories evoked by 
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single molecules. It achieves prospective pre- 
dictive accuracy in odor description that ex- 
ceeds that of the typical individual human, and 
it is broadly transferrable to arbitrary olfactory 
perceptual tasks using natural and interpret- 
able transformations. 

Nearly all of the published chemosensory 
models were fit to the data used in their con- 
struction. Even when using cross-validation, 
the opportunity for overfitting is high because 
the data come from a single distribution, task, 
or experimental source. Prospective validation 
on new data from a new source with no adjust- 
ments represents a much more stringent test of 
real-world utility. In this prospective context, 
we found that our model performs roughly on 
par with the median human panelist, beating a 
chemoinformatic baseline. However, in a real- 
world setting, models can and should be up- 
dated as new data become available (37). A linear 
model using the POM coordinates reaches an 
even higher level of performance when the POM 
is tuned to the new dataset (Fig. 5C). 

The success of this model is not merely an 
advance in predictive modeling. It offers a sim- 
ple, contiguous, hierarchical, parseable map of 
molecular space in terms of odor, much as color 
spaces represent wavelengths of light in terms 
of colors and color components. This model not 
only enables human-level performance for odor 
description but also generalizes to a gamut of 
other olfactory tasks. It offers the opportunity 
to reason, intuitively and computationally, about 
the relationships within and between molec- 
ular and odor spaces. Unlike well-known color 
spaces, this model does not provide clear guid- 
ance about how stimuli can be mixed to produce 
new percepts nor does it use a biologically plau- 
sible architecture. Its closest analogy in vision 
is the Munsell color system, which is a princi- 
pled way to describe a color in terms of coor- 
dinates but lacks any specific guarantees about 
mixture behavior. Nonetheless, the Munsell 
system (and we hope the POM) is still consid- 
ered to be a useful representation of sensory 
perception. Further work can aim for a CIE- 
like representation of odor—one that specifi- 
cally predicts what odors can be made from 
mixing specified components. 

There are some practical considerations to 
keep in mind when using this map. First, the 
concentration of an odor influences odor char- 
acter but is not explicitly included in the map. 
Although it can predict a detection threshold, 
which is a property of the odorant molecule, it 
cannot predict suprathreshold intensity, which 
is a function of the odorant and its concentra- 
tion. Many molecules have no odor, which we 
addressed by prescreening with a separate, 
simpler model (32), and we diluted odorants 
to standardize intensity. Second, predictive 
performance is strong for organic molecules, 
which are most of the odorants that we en- 
counter, but we could not extend the predic- 
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tions into halides or molecules that include 
functional groups without extensive safety 
data. Given the uniformly strong performance 
across the broad chemical classes that we 
tested in our prospective validation set (Fig. 3C), 
we expect high accuracy on novel chemicals 
within these chemical classes, but we would 
not expect high performance for molecules 
that have chemical motifs that were not rep- 
resented in our training set. For instance, if 
our training dataset did not contain any mol- 
ecules with carbon macrocycles, we would not 
expect the model to accurately predict the odor 
of an unseen macrocyclic musk (Fig. 3A). Third, 
many chemical stimuli have odorous contam- 
inants (25), particularly those that have not 
been developed for use in fragrance applica- 
tions. Neural networks perform well, even with 
substantial noise in the training and test sets, 
which we see in the present work. Nonethe- 
less, we recommend isolating the compound of 
interest from odorous contaminants and/or 
characterizing the perceptual quality of con- 
taminants. Fourth, the model was designed to 
predict the population average, much as color 
maps predict average perception. It does not 
yet account for individual differences in per- 
ception. Finally, datasets in real-world settings 
are not static but grow and shift in distribution; 
thus, models should be periodically retrained 
to incorporate new data. Model performance 
tends to improve with increased training data 
(Fig. 3B) and data quality (fig. S15), consistent 
with machine-learning applications in other 
areas (33, 34). 

Progress in neuroscience is often measured 
by the creation and discovery of new maps of 
the world supported by neural circuitry—maps 
of space in hippocampus, tonotopy in auditory 
cortex, and retinotopy and Gabor filters in V1 
visual cortex, among others. Each is only pos- 
sible because scientists first possessed a map 
of the external world and then measured how 
responses in the brain varied with stimulus 
position on the map. This study proposes and 
validates a data-driven map of human olfac- 
tion. We hope this map will be useful to re- 
searchers in chemistry, olfactory neuroscience, 
and psychophysics—first, as a drop-in replace- 
ment for chemoinformatic descriptors and, 
more broadly, as a new tool for investigating 
the nature of olfactory sensation. 
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Cell polarity linked to gravity sensing is generated 
by LZY translocation from statoliths to the 


plasma membrane 


Takeshi Nishimura’?+, Shogo Mori!+, Hiromasa Shikata’?+, Moritaka Nakamura’, 
Yasuko Hashiguchi*+, Yoshinori Abe*, Takuma Hagihara’, Hiroshi Y. Yoshikawa’, 


Masatsugu Toyota*®7, Takumi Higaki®®, Miyo Terao Morita 


1,2. 


Organisms have evolved under gravitational force, and many sense the direction of gravity by means 
of statoliths in specialized cells. In flowering plants, starch-accumulating plastids, known as amyloplasts, 
act as statoliths to facilitate downstream gravitropism. The gravity-sensing mechanism has long 

been considered a mechanosensing process by which amyloplasts transmit forces to intracellular 
structures, but the molecular mechanism underlying this has not been elucidated. We show here that 
LAZY1-LIKE (LZY) family proteins involved in statocyte gravity signaling associate with amyloplasts 
and the proximal plasma membrane. This results in polar localization according to the direction of 
gravity. We propose a gravity-sensing mechanism by which LZY translocation to the plasma membrane 
signals the direction of gravity by transmitting information on the position of amyloplasts. 


lants sense their inclination relative to 

the direction of gravity and reorient their 

growth direction accordingly, a process 

called gravitropism (J, 2). The directional 

change is sensed mainly by specialized 
cells called statocytes. Statocytes contain amylo- 
plasts, which are plastids filled with dense starch 
granules (3, 4). Amyloplasts act as statoliths 
within cells by displacing toward gravity. Amylo- 
plast displacement is believed to promote auxin 
transport in the direction of gravity, leading to 
organ bending (5). Several hypotheses have been 
proposed regarding the sensing mechanism by 
which the physical process of amyloplast displace- 
ment generates biochemical signals in stato- 
cytes, including the force-sensing model (6) and 
position sensor hypothesis (7). However, the mo- 
lecular mechanisms of gravity sensing and sig- 
naling remain largely unknown. 

LAZY1-LIKE (LZY) family genes, which are 
conserved in most land plants, are involved 
in gravitropism and whole-plant architecture 
through the control of branch angles in various 
species in angiosperms, including trees (8-15). 
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In Arabidopsis, LZY proteins have been shown 
to play critical roles in the signaling process 
controlling directional auxin flow after amylo- 
plast displacement in statocytes in both shoots 
and roots (16). Although there are no known 
functional domains in LZY proteins, five con- 
served regions with 10 to 20 amino acids were 
found (fig. S1A) (2, 10). Mutational analyses 
have demonstrated the functional importance 
of regions I, II (also known as IGT motif) (15, 17), 
and V (CCL region) (16, 18). LZY1, rice LAZY1, 
and maize ZmLA1 localize to the plasma mem- 
brane (PM) and the nucleus in ectopic or tran- 
sient expression systems. As a result, possible 
functions in the nucleus in addition to the PM 
have been suggested (14, 19, 20). It should be 
noted that, under the control of its own pro- 
moter, LZY3 is polarly localized to the PM in 
the direction of gravity and repolarizes in re- 
sponse to the reorientation in statocytes in the 
columella cells of lateral roots (21). This find- 
ing prompted us to investigate the mechanism 
of LZY3 polarization to understand gravity sens- 
ing and signaling in gravitropism. 


Regions for functional regulation of LZY3 


We determined the regions responsible for mem- 
brane association in the LZY3 protein, which 
lacked detectable transmembrane domains (by 
InterProScan) (22). Basic-hydrophobic clusters 
in proteins can facilitate their membrane asso- 
ciation in eukaryotes, including plants (23-25). 
A computational search indicated that all LZY 
proteins in Arabidopsis have more than two 
regions with a high basic-hydrophobic cluster 
score (20.6) that have the potential to serve 
as unstructured membrane-binding sites (fig. 
S1, A and B) (23). Glutamine (Gln) substitu- 
tions were introduced at two sites, designated 
sites A and B, in LZY3 that decreased the basic- 
hydrophobic cluster scores of the corresponding 
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mary roots of /zy2;3;4 triple mutants tende—.2 
grow upward (2), the expression of LZY3 fused 
with mCherry (LZY3-mCherry) harboring A-7Q, 
A-10Q, and B-7Q substitutions, as well as wild 
type (WT), rescued the primary root angle 
phenotypes of lzy2;3;4 under the control of the 
LZY3 promoter (Fig. 1B and fig. S1, D and E). 
Plants expressing LZY3 B-11Q exhibited varia- 
ble growth directions (Fig. 1B and fig. S1D). 
The LZY3-mCherry signals of WT, A-7Q, and 
B-7Q were faint in the living columella cells of 
the primary roots. In contrast, the signals of 
A-10Q and B-11Q exhibited observably in- 
creased fluorescence to different extents (Fig. 
1C and fig. S1, C and G). The fluorescence sig- 
nals of A-10Q accumulated at the lower side 
of the PM, whereas the signals of B-11Q were 
mainly detected in the cytosol (Fig. 1, C and D). 
Combination of B-7Q with A-7Q or A-10Q in- 
creased the LZY3-mCherry levels and the rel- 
ative fluorescence intensities in the cytosol, 
which is consistent with their lower degree of 
phenotypic rescue in gravitropism (Fig. 1, C 
and D, and fig. S1, C, F, and G). LZY3-mCherry 
that has both A-10Q and B-11Q was mostly 
found in the cytosol, as was the version with ‘ 
only B-11Q (Fig. 1C). These findings suggest 
that sites B and A, including their surrounding 
sequences, play major and minor roles, re- 
spectively, in PM association of LZY3, possibly ‘ 
through electrostatic interactions with anionic 
membrane lipids, and that both regions help 
maintain LZY3 at low levels (fig. SIG). 

A putative PEST motif (i.e., a motif enriched 
with Pro, Glu, Ser, and Thr), which may facil- 
itate protein degradation via the proteasome 
pathway, is found in the vicinity of site A, im- 
plying that it may be involved in the regulation 
of LZY3 levels (fig. S2A) (26, 27). The predicted 
PEST score of A-10Q is reduced compared with 
wild-type LZY3, whereas that in A-7Q is not ‘ 
affected (fig. S2A). As expected, LZY3APEST- 
mCherry, which lacks the PEST motif without 
impairing site A itself, displayed similarities to 
A-10Q in terms of root growth angle, localiza- 
tion pattern within columella cells, and accu- 
mulation level within those cells (fig. $2, B to 
D). These findings suggest that the accumula- 
tion of LZY3-mCherry on the PM in A-10Q lines 
resulted from impairment of the PEST motif, 
thereby disturbing quantitative regulation of 
LZY3 on the PM through proteasomal protein 
degradation. The addition of a farnesylation site 
[K8-far (28)] at the C termini of LZY3 mutant 
proteins B-11Q and A-10Q;B-11Q overrode the 
effect of their mutation on localization and ac- 
tivity (fig. $3). These results suggest that the 
B-11Q and A-10Q;B-11Q proteins retain all func- 
tions of the wild-type LZY3 with the exception 
of association with the PM. In addition, when 
K8-far was present, the level of the B-11Q protein 
was reduced, implying a link between the mem- 
brane localization and quantitative regulation 
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Fig. 1. A putative membrane association site in LZY3 is required for subcellular localization in the 
columella cells and for root gravitropism. (A) Putative PM association sites in LZY3. The lines indicate 
sites with basic-hydrophobic scores = 0.6 according to computational prediction. In A-7Q and A-10Q, 

7 and 10 basic amino acid residues at and near site A, respectively, were replaced with glutamine 
residues. In B-7Q and B-11Q, 7 and 11 basic residues at and nearby site B, respectively, were replaced 

with glutamine residues. The basic residues (K and R) and glutamine residue (Q) are colored magenta 

and green, respectively. Single-letter abbreviations for the amino acid residues are as follows: A, Ala; 

C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; I, lle; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg; S, Ser; 
T, Thr; V, Val; W, Trp; and Y, Tyr. (B) Absolute values of growth angle (0) of the primary root tips of 
5-day-old seedlings of wild-type, Izy2;4, Izy2;3;4, and transgenic lines expressing mCherry-tagged LZY3 
mutant proteins under the control of the LZY3 promoter (LZY3p::LZY3-mCherry) in the Izy2;3;4 mutant 
background. The direction of each root tip was measured as the absolute value toward the direction of gravity 
(g). The frequency was calculated as the proportion of root numbers that fell within intervals of 15° to 

the total number of analyzed roots for each line (range: 0° to 180°). Asterisks indicate significant differences 
P < 0.0033 (0.05/15)] by Wilcoxon rank sum test with Bonferroni correction (* compared with Izy2;4; 

** compared with Izy2;3;4). (C) Representative confocal images of wild-type, A-10Q, B-11Q, and A-10Q;B-11Q 
mutants of LZY3-mCherry in columella cells of primary roots. The area containing tier 1 to 3 columella 
cells of the representative lines is shown. The seedlings were kept in a vertical position before and during 
imaging. The roots of the B-11Q and A-10Q;B-11Q lines were straightened at least 3 hours before imaging 

to make the amyloplasts sediment toward the root tip, as those roots were meandering (fig. S1D). All images 
were acquired with an identical optical setting, and the color-coded heatmaps for signal intensities appear 
below each image. Scale bars, 20 um. QC, quiescent center. (D) Ratio of fluorescence intensities of 
LZY3-mCherry at the PM to that of the whole cell. The ratios were calculated with mean values within a 
3-pixel-width line drawn on the PM region and within the area surrounded by the line. For each line, 

9 to 41 cells with clear cell edges in 5 to 8 roots were measured. Wild-type, A-7Q, and B-7Q lines were 
not analyzed because of low protein accumulation. 
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of LZY3. Taken together, these findings indi- 
cate that the PM localization of LZY3 is re- 
quired for its function in gravity signaling. 


LZYs associate with amyloplasts and the 
nearby PM 


Meanwhile, we noticed faint ring-shaped fluo- 
rescence signals in the tier 3 columella cells 
of wild-type LZY3-mCherry, which were ob- 
served at the position of amyloplasts (Fig. 1C). 
Given that LZY localizes to amyloplasts, LZY 
might help link gravity sensing to the down- 
stream signaling. However, the weak fluorescence 
intensity of LZY3-mCherry made it difficult to 
analyze the behavior of LZY3 in tier 2 cells, 
which are the major contributors to gravitropism 
(29). Therefore, we examined whether LZY4, 
which redundantly functions in root gravi- 
tropism (30, 37), can be applied for live-cell im- 
aging analysis. There are four splicing variants _ 
of LZY4 (annotated in the TAIR1O genome), 
three of which lack the CCL region (fig. S4, A and 
B), which is important for LZY1/2/3 function 
(6, 18, 21). We demonstrated that LZY4.4 with 
a CCL-like sequence is the functional form in 
gravitropism (fig. S4C). The CCL-like sequence of 
LZY4.4 was capable of interacting with the 
BREVIS RADIX (BRX) domain of RCC1-like do- 
main proteins (RLDs) in the yeast two-hybrid 
system (fig. S4D). RLD1 is an interacting part- 
ner of LZY in the regulation of auxin transport 
through membrane trafficking, and it is re- 
cruited to the PM in a LZY2/3-dependent man- 
ner (21, 32). LZY4.4 recruited RLD1 to the PM in 
Arabidopsis protoplasts (fig. $5). In addition, 
LZY4.4-mClover3 driven by the statocyte-specific 
ADF9 promoter rescued the primary root angle 
phenotype of lzy1;2;3;4 (fig. S6). These results 
indicate that the molecular function of LZY4.4 
(hereafter referred to as LZY4) is almost equiv- 
alent to that of LZY3 in root statocytes. 
Ring-shaped and linear fluorescence signals 
from functional LZY4p::LZY4-mScarlet were 
observed in the tier 2 columella cells of both 
young lateral and primary roots (Fig. 2, A and 
B, and fig. S7A). The linear fluorescence signal 
merged with that from the PM-staining re- 
agent (fig. S8), which confirms the PM local- 
ization of LZY4. LZY4-mScarlet on the PM 
was observed only proximal to amyloplasts, 
implying polarity in the direction of gravity 
(Fig. 2A). LZY4 also has a basic amino acid- 
rich region in the middle of the protein (basic- 
hydrophobic cluster score = 0.6; figs. SLA and 
S9A). LZY4-mScarlet carrying the B-6Q muta- 
tion in this region exhibited lower fluorescence 
at the PM, lack of polarity, and weaker capac- 
ity for phenotypic rescue, suggesting a PM- 
binding mechanism and function similar to 
LZY3 (Fig. 2, B and C, and figs. S7B and S9B). To 
investigate the association of the ring-shaped 
signal of LZY4-mScarlet with amyloplasts, we 
conducted observations using green fluorescent 
protein-fused PERMEASE IN CHLOROPLASTS1 
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(PIC1-GFP) and OUTER ENVELOPE MEM- 
BRANE PROTEIN 7 (OEP7-GFP), both of which 
serve as plastid markers localized at an inner 
and outer envelope of plastids, respectively. 
These observations were performed on both 
living and fixed lateral root columella cells. 
LZY4-mScarlet was observed in close proxim- 
ity to or partially overlapping with PICI-GFP 
and OEP7-GFP, indicating that LZY4 likely lo- 
calizes to the amyloplast envelopes or their im- 
mediate vicinity (Fig. 2D and fig. S10). However, 
defining the exact localization of LZY4 in amylo- 
plasts remains challenging because of the weak 
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expression levels of LZY4-mScarlet and the en- 
velope markers, as well as technical limitations. 

Next, we generated LZY4-mScarlet lacking 
the N-terminal domain I (LZY4-AI-mScarlet) 
and found that the protein was present at the 
PM and in the cytosol, but the ring-shaped sig- 
nal was hardly detected (Fig. 2E and fig. S7C). 
In addition, LZY4-AI-mScarlet was not fully 
functional (Fig. 2B). Consistently, mCherry fused 
with the N-terminal 54 amino acids of LZY3, 
including the conserved domain I, displayed ob- 
vious ring-shaped signals (fig. S11). These find- 
ings suggest that the N-terminal region including 
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Fig. 2. Amyloplast and basal plasma membrane localization of LZY4 in the columella cells. (A) Subcellular 


mutant background. Scale bars, 10 um. 
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ocalization of LZY4-mScarlet (#2)/lzy4 in lateral root columella cells. Arrows in the right panel indicate 
the amyloplast-like ring shape localization of LZY4-mScarlet, and arrowheads indicate the localization to the 
basal plasma membrane. BF, bright field. Scale bars, 10 um. (B) Complementation test of LZY4-mScarlet/ 
Izy2;3;4, LZY4(B-6Q)-mScarlet/Izy2;3;4, and LZY4-Al-mScarlet/Izy2;3;4 in the direction of the primary root 
tips of 5-day-old seedlings. Asterisks indicate significant differences [P < 0.0045 (0.05/11)] by Wilcoxon 
ank sum test with Bonferroni correction (*% compared with Izy23; ** compared with Izy2;3;4). (©) Subcellular 
localization of LZY4(B-6Q)-mScarlet (#7)/Izy2:3.4 in lateral root columella cells. Scale bars, 10 um. (D) Localization 
of LZY4-mScarlet and PIC1-GFP in amyloplasts. Scale bar, 10 um. (E) Subcellular localization of LZY4-Al- 
mScarlet (#12)/Izy2;3;4. Scale bars, 10 wm. (F) Subcellular localization of LZY4-mScarlet (#2) in the Izy4 arl2 
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domain I is involved in the amyloplast localiza- 
tion of LZY3/4. 

Several mutations in genes encoding com- 
ponents of the translocon at the outer chloro- 
plast membrane (TOC) complex enhance the 
phenotype of altered response to gravity 1 (argl) 
and its paralog argi-like2 (arl2) (33, 34). ARGI 
and ARL2, encoding J-domain proteins, are in- 
volved in gravity signaling in statocytes (35-37) 
similarly to LZYs. We investigated the possible 
involvement of ARG1 and ARL2 in the amylo- 
plast localization of LZY4-mScarlet. In the arg 
and arl2 background, the LZY4-mScarlet signal 
became undetectable in amyloplasts and at the 
PM, and the signal was dispersed in the cyto- 
sol (Fig. 2F and figs. S7, D and E, and S12). ARG1 
and ARL2 may be involved in the recruitment 
of LZYs to amyloplasts, possibly through their 
chaperone activity. We cannot rule out the 
possibility that ARGI and ARL2 might also be . 
involved in the PM localization of LZYs, be- 
cause ARL2 is reported to localize at the PM 
(36). Considering that precursor proteins tar- 
geted to plastids interact with TOC through 
their N-terminal transit peptides with the help 
of molecular chaperones (38), mechanisms 
similar to plastid protein targeting could be in- 
volved in targeting LZYs to amyloplasts. Fur- 
ther investigations into the precise localization 
of LZYs in amyloplasts and their targeting 
mechanisms are warranted to gain a deeper 
understanding of the LZY-mediated interac- 
tion between the amyloplast and PM. 


LZY4 polarity rapidly changes 
upon gravistimulation 


Next, we examined the effect of gravistimulation 
by reorientation of plants on the localization 
of LZY4-mScarlet. We used live-cell imaging of 
lateral root columella cells with a vertical stage 
confocal microscope (39). Before gravistimu- 
lation, polar localization of LZY4-mScarlet on 
the PM was maintained throughout the ob- 
servation, although amyloplasts were slightly 


agitated (fig. S13B and movie S1). After gravi- - 


stimulation, LZY4-mScarlet appeared on the 
PM of the new lower side of the cells at almost 
the same time as amyloplast displacement, and 
the signal gradually became stronger (Fig. 3, A 
and B; fig. S13C; and movie $2). Simultaneously, 
the LZY4-mScarlet signal on the PM in the orig- 
inal direction of gravity gradually disappeared, 
resulting in the generation of the new polarity 
of LZY4. Such repolarization on the PM was not 
observed with LZY4(B-6Q)-mScarlet (fig. S14 and 
movie S3). The starchless phosphoglucomutase 
(pgm) mutant has amyloplasts that hardly sedi- 
ment, because of the lack of dense starch gran- 
ules, and it exhibits reduced gravitropism (40-42). 
In the pgm mutant, LZY4-mScarlet localized to 
smaller starchless amyloplasts and to the PM. 
LZY4-mScarlet failed to accumulate polarly on 
the PM in the pgm mutant, probably because 
of altered movement of amyloplasts (Fig. 3C, 
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Fig. 3. Polar localization of LZY4 changes in 
response to gravistimulation. (A) Gravistimulation and 
the site of observation. Stage 2 lateral roots (<3 mm) 
were used. (B and C) Representative confocal images 
of the columella cells of LZY4p::LZY4-mScarlet in the Izy4 
(B) or Izy4 pgm-1 (C) background immediately before 
(0 min) or 15 min after gravistimulation. The seedlings 
were kept in a vertical position before and during imaging. 
mScarlet was excited at 561 nm, and the fluorescence 
that passed through a 617/73 nm emission filter was 
detected. Scale bars, 25 um. (D) Temporal changes 
in the localization of LZY4-mScarlet on the PM along 
the direction of gravity. Images were taken at 20-s 
intervals for 15 min. Eight cells from five of the nonrotated 
lateral roots and 17 cells from 11 of the 135°-rotated 
ateral roots were used for quantification. Error bars 
indicate 95% confidence intervals. a.u., arbitrary units. 


fig. S15, and movie S4), indicating that amylo- 
plast sedimentation is required for the polar- 
ization of LZY4 on the PM. 

We quantified the temporal change in the lo- 
calization of the LZY4-mScarlet signal at the PM 
along the direction of gravity (Fig. 3D and fig. 
S16). Briefly, to focus on the change in the fluo- 
rescence intensities on the PM along the vertical 
line (y axis), the intensities on the PM were av- 
eraged along the axis (fig. S16, A to H). Then, 
each averaged value along the y axis was multi- 
plied by a value (between 1 and 100) assigned to 
the same position on the y axis, which is linearly 
distributed from the bottom to the top of the 
intensity profile (fig. S16, I and J). The calculated 
values were integrated for each time frame (fig. 
S16K), and this weighting process was designed to 
produce progressively lower integrated values 
over time as LZY4-mScarlet moves toward the 
direction of gravity. Thus, a higher value of 
weighted intensity means that LZY4-mScarlet 
polarizes at a higher position on the PM, whereas 
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Fig. 4. Translocation of Lzv4 from the aid to the plasma membrane. (A and B) Observation of the 
translocation of LZY4-mEos2 from the amyloplasts to the PM by the photoconversion method (A). Immediately after 


photoconversion, there was almost no red fluorescence signal on the PM (white arrowhead at O min). At 1 min after 
photoconversion, a linear red fluorescence signal was observed at the PM in close proximity to the amyloplasts 
(white arrowhead at 1 min). Scale bar, 10 um. There was a significant difference between the findings at O and 1 min 
(Wilcoxon rank sum test, P < 0.05). The increase in the red fluorescence signal on the PM at 1 min after 
photoconversion was quantitatively analyzed and plotted as boxplots. Center lines show the medians; box limits 
indicate the 25th and 75th; whiskers extend 15 times the interquartile range from the 25th and 75th percentiles; 
outliers are represented by dots (B). The plots represent the results of eight independent experiments. See 
also fig. S21B, showing images of mEos2 green fluorescence signals. (© and D) Temporal changes in the 
ocalization of LZY4-mScarlet upon manipulating amyloplasts with an optical tweezer. Amyloplasts were fixed 


with the optical tweezer and positioned near the PM (yellow arrowheads) by moving the roots immobilized on 
the stage. The representative change is shown in (C). Scale bar, 5 um. The temporal changes in the signal of 
mScarlet on the PM where the amyloplasts were positioned are quantitatively shown in (D). The plot represents 
the mean + standard error of three independent experiments. 


a lower value means that LZY4-mScarlet polar- 
izes in the direction of gravity. Without gravi- 
stimulation, the weighted intensity gradually and 
slightly decreased, probably because of quench- 
ing during the 15-min observation. In contrast, 
weighted intensity decreased after gravistimula- 
tion, indicating repolarization of LZY4-mScarlet 
in the direction of gravity. An incremental de- 
crease in weighted intensity occurred immedi- 
ately after gravistimulation, presumably linked 
with amyloplast displacement. 

To clarify the temporal relationship between 
the repolarization of LZY4 and other processes 
in gravitropism, we examined LZY4-mScarlet 
repolarization, auxin distribution, and organ 
curvature with primary roots under as sim- 
ilar conditions as possible (fig. S17). The be- 
havior of LZY4-mScarlet in primary roots was 
almost the same as that in lateral roots. Sig- 
nificant repolarization of LZY4-mScarlet was 
detected within 15 min after gravistimulation 
(fig. S17, A to C, and movie S5). The asymmetric 
distribution of auxin, detected using the ratio- 
metric auxin biosensor R2D2 (43), appeared 
16 min after gravistimulation, and it became sig- 
nificant at 30 min (fig. S17, D and E). Significant 
root curvature appeared 30 min after gravi- 
stimulation (fig. S17F). In addition, amyloplast 
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displacement occurred 3 min after gravistim- 
ulation in our observation system, as previously 
reported (6). Polar localization of RLD1-GFP 
merged with LZY4-mScarlet was observed at 
15 min after gravistimulation (fig. S18). There- 
fore, repolarization of LZY4-mScarlet is chrono- 
logically consistent with a role in linking gravity 
sensing to signaling. Although we tested relo- 
cation of PIN3-GFP upon gravistimulation ac- 
cording to the measurement method reported 
previously (21, 44), obvious relocation was not de- 
tected within 15 min (fig. S19). As reported previ- 
ously, the polar pattern of PIN3-GFP in the root 
cap columella after gravistimulation was observed 
300 min after gravistimulation in the stage 2 lat- 
eral root in our experimental conditions (27). This 
implies that the LZY-RLD module may not direct- 
ly regulate the trafficking of PIN3 but rather reg- 
ulates other regulatory factors for auxin transport. 


Amyloplast position determines LZY4 polarity 
at the PM 


Given that LZY interacts electrostatically with 
negatively charged lipids, such lipids might 
polarly distribute on the PM and contribute to 
the polar localization of LZYs. Therefore, we 
examined the distribution of several nega- 
tively charged lipids in the statocytes of primary 
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and lateral roots using fluorescent biosen- 
sors (fig. S20). Only the phosphatidylinositol- 
4,5-bisphosphate biosensor CITRINE-2xPH?“ 
(45) tended to be polarly distributed toward the 
root tip. However, clear repolarization of CITRINE- 
2xPH?““ was not detected upon gravistimula- 
tion in the examined time window. Therefore, 
the distribution of these lipids is unlikely to be 
related to the polarity of LZYs. 

Next, to elucidate the relationship between 
LZY4 residing in amyloplasts and its polariza- 
tion at the PM, the behavior of LZY4 was ana- 
lyzed using the LZY4p::LZY4-mEos2/lzy1;2;3;4 
rescued line (fig. S21A). LZY4-mEos2 on the 
amyloplasts was photoconverted from green 
to red fluorescence by irradiation with a 405-nm 
laser on a vertical stage confocal microscope. 
The red fluorescence signal was observed only 
in amyloplasts immediately after irradiation, 
and a linear pattern emerged on the nearby PMs 
(Fig. 4A; figs. S21, B and C, and $22; and movie 
S6). The fluorescence intensity on the PM prox- 
imal to amyloplasts significantly increased 
1 min after irradiation (Fig. 4B). Although we 
could not dismiss the possibility that a trace 
amount of cytoplasmic LZY4-mEos2 was photo- 
converted and accumulated at the PM, this re- 
sult suggests that LZY4 is rapidly translocated 
from amyloplasts to the PM. It is expected that 
the position of amyloplasts determines the LZY4 
accumulation site on the PM independent of 
the direction of gravity. To test this possibility, 
amyloplasts were manipulated by an optical 
tweezer during observation by confocal laser 
scanning microscopy (46). After trapping some 
amyloplasts, the amyloplasts were manipulated 
into close proximity of the PM (figs. S23 and S24 
and movies S7 and S8). The fluorescence inten- 
sity of LZY4-mScarlet significantly increased at 
the PM region where the amyloplasts were new- 
ly placed nearby (Fig. 4, C and D; fig. S23, G to L; 
and movie S9), indicating that amyloplasts act 
as determinants of LZY4 polarity on the PM. 

Gravity sensing in gravitropism has long been 
considered a mechanosensing mechanism 
whereby the weight of statoliths exerts a force 
on intracellular structures (6, 47). On the basis 
of the experimental results using angiosperm 
shoots, it has recently been suggested that the 
sensor functions as a clinometer but not as a 
force sensor (48). This supports the “position 
sensor hypothesis,” which proposes that prox- 
imity or contact between statoliths and the 
membrane induces local auxin fluxes (7) and 
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that statocytes act as clinometers (49), although 
there had been no known molecules support- 
ing this hypothesis. We revealed in this study 
that LZY exhibits behavior that fits the posi- 
tion sensor hypothesis well. LZY polarity on the 
PM formed according to the position of amylo- 
plasts by translocation of LZY from amyloplasts 
to the PM (fig. S25A). LZY appears to act as a 
signal molecule that transmits the positional 
information of statoliths to the PM, which di- 
rectly links gravity sensing to subsequent signal- 
ing processes. Polarly located LZY recruits RLD 
to promote polar auxin flow in the direction of 
gravity (fig. S25B) (27). 
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SILICONE CHEMISTRY 


Ring-opening polymerization of cyclic oligosiloxanes 
without producing cyclic oligomers 


Limiao Shi, Aurélie Boulégue-Mondiére”, Delphine Blanc”, Antoine Baceiredo’, 


Viceng Branchadell*, Tsuyoshi Kato™* 


A long-standing problem associated with silicone synthesis is contamination of the polymer products with 10 to 
15% cyclic oligosiloxanes that results from backbiting reactions at the polymer chain ends. This process, in 
competition with chain propagation through ring-opening polymerization (ROP) of cyclic monomers, was 
thought to be unavoidable and routinely leads to a thermodynamically controlled reaction mixture (polymer/ 
cyclic oligosiloxanes = 85/15). Here, we report that simple alcohol coordination to the anionic chain ends 
prevents the backbiting process and that a well-designed phosphonium cation acts as a self-quenching system 
in response to loss of coordinating alcohols to stop the reaction before the backbiting process begins. The 
combination of both effects allows a thermodynamically controlled ROP of the eight-membered siloxane 

ring D, without producing undesirable cyclic oligosiloxanes. 


ince the discovery of polysiloxanes, com- 

monly called silicones, in the first at- 

tempt to synthesize a stable silanone 

by F. Kipping in 1901 (J) and their first 

industrial production by Dow-Corning 
Corporation in 1943 (2), these organosilicon 
materials have found extremely wide-ranging 
applications in different forms (oil, rubber, gel, 
and resin) because of their particular chemical 
and physical properties (3). Despite important 
progress in their synthesis (4-6), a long-known 
but still unsolved drawback is their systematic 
contamination with cyclic oligosiloxanes (~10 to 
15%) regardless of the methods used (3, 4). Con- 
sidering the potential toxicity of low-molecular- 
weight cyclic siloxanes (7) capable of overcoming 
the human skin barrier (8), it is critical to ad- 
dress this contamination problem (9). Currently, 
nearly 50% of new skin care products contain 
at least one type of silicone (10). 

The problem arises from backbiting reac- 
tions at the polymer chain end to produce cy- 
clic oligosiloxanes of different sizes [four- to 
seven-membered rings (D4.7); Fig. 1A] (77, 12). 
The rate of backbiting is competitive with the 
propagation through ring-opening of cyclic 
monomers, leading to a backbiting/propaga- 
tion equilibrium and therefore to a polymer/ 
cyclic oligomer mixture in a thermodynami- 
cally controlled ratio (~85/15; Fig. 1B). This 
means that in the polymer chemistry of sili- 
cones, most of the reactions catalyzed by either 
acid or base end with a ring-opening polymer- 
ization (ROP) process as the final stage, leading 
to the same contamination (4). An exception 
is the ROP of hexamethylcyclotrisiloxane (D3) 
(13, 14), which allows a substantially faster pro- 
cess (100 to 1000 times faster than that of octa- 
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methylcyclotetrasiloxane D,) (72). In this case, 
with proper choice of initiator/catalyst and 
conditions, the ROP of Ds is kinetically con- 
trolled and achieves complete monomer con- 
version (Fig. 1C, dashed line) (15-21), whereas 
the anionic ROP of other monomers (D,.7) is 
under thermodynamic control, affording the 
conventional polymer/cyclic oligomer mixture 
(Fig. 1C, solid line) (72). Thus, the achievement 
of the ROP of D, (most thermodynamically fa- 
vored) without producing cyclic oligomers by 
somehow preventing the backbiting process 
and altering the thermodynamic polymer/cyclic 
oligomer ratio (from 85/15 to 100/0) (Fig. 1E) 
would be a fundamental solution for this issue. 
Here, we report that simple coordination of al- 
cohols to the active silanolate chain ends pre- 
vents the backbiting process (Fig. 1D), and that 
well-designed phosphonium counter-cations act 
as a smart self-quenching system in response 
to the loss of coordinating alcohols to stop the 
reaction before the backbiting process starts. 
The combination of these two effects allows the 
thermodynamically controlled anionic ROP of 
D, without producing undesirable cyclic oligo- 
siloxane contaminants (Fig. 1E). 

As mentioned previously, the origin of the 
problem, incomplete ROP of cyclic oligosilox- 
anes, is the competitive rate of backbiting rela- 
tive to propagation. Thus, we hypothesized that 
flexible coordination between weakly acidic 
alcohols and the anionic silanolate chain end 
should increase steric hindrance around the 
nucleophilic site and modulate the relative 
rates without excessively reducing the propa- 
gation rate (Fig. 1D). Nucleophilic attack of the 
anionic chain end on silicon centers of the 
linear siloxane chains (i.e., the backbiting pro- 
cess) could be significantly more hindered by 
this effect than attack on the sterically more 
accessible silicon atom of a rigid cyclic mono- 
mer for propagation process. On the basis of 
this hypothesis, we performed the ROP of Dy, 
in the presence of an excess of benzyl alcohol 
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q 


[5 equivalents (equiv.)] relative to tBuOK/ayp Se 
initiator (0.5 mol % versus D4) (21, 22) at 8.-=-— 
Careful monitoring of the ROP demonstrated 
that the polymer ratio to cyclic oligomers (D,.7) 
reached 95% after 6 hours and then gradually 
decreased to 89% in 12 hours. This result sug- 
gests that there is a certain effect preventing 
the backbiting reactions that gradually dis- 
appears over time. This disappearance can be 
attributed to the loss of coordinating alcohols 
at the anionic chain ends either by chain trans- 
fer or by dissociation at the last stage of poly- 
merization, leading to the conventional situation 
with the propagation/backbiting equilibrium. 
Probably for this reason, the effect of alcohol 
coordination has been ignored previously. 

A possible solution is an automatic quench 
system responsive to the loss of coordinating 
alcohols. For this purpose, we considered the 
use of (-propyl)tris(dialkylamino)-phosphonium 
alkoxides 1-xROH that are stable only with the 
alkoxide anion stabilized by coordination of al- 
cohols through weak hydrogen-bonding interac- 
tions (Fig. 2A) (23). Without a sufficient number 
of stabilizing alcohols, the alkoxide readily at- 
tacks the P atom of the phosphonium cation, 
which, through a pentacoordinate intermediate ‘ 
3, affords either amine 4 and phosphine oxide 
5 (wv = 0) or ether 6, amine 7, and phosphine 
oxide 5 (w = 1) (Fig. 2B) (24). The use of such 
phosphonium alkoxides 1-xXROH as the initia- * 
tor of ROP leads to the formation of silicones 
with alcohol-stabilized anionic silanolate chain 
ends that should behave in the same way. In 
other words, once coordinating alcohols are lost, 
the ionic pair (silanolate anion/phosphonium cat- 
ion) at the chain end start reacting to gener- 
ate a neutral chain end (Si-NR, or Si-OR), thus 
quenching the polymerization process. 

Alcohol-free and -stabilized phosphonium 
alkoxides 1-xBnOH were synthesized by the - 
addition of benzyl alcohol (BnOH) to phos- ‘ 
phonium ylides 2a-e with different dialkyla- 
mino substituents (Fig. 2, A and C) (24). The 
number of coordinating alcohols (@) can be 
precisely controlled by the quantity of BnOH 
(@ + 1 equiv.) added to the ylide 2. As ex- 
pected, they were thermally labile, and the 
least-hindered variant, la-4BnOH, fully de- 
composed within 1 hour (42 = 15 min) at 80°C 
in CgDg (M = 0.07 mol/liter) to generate the 
corresponding dibenzylether 6, dimethylamine 
7, and phosphine oxide 5. This result indicates 
that the decomposition proceeds with the in- 
tervention of an alcohol [Fig. 2B, (w = 1)] and 
thus starts before complete loss of stabilizing 
alcohols. The decay gradually slowed down as 
free benzyl alcohol in the medium increased 
each time la-4BnOH decomposed. The ef- 
fects of the phosphonium cation structure and 
the quantity of coordinating alcohol on their 
stability (at 80°C in CgDg) were then assessed 
to regulate the stability/decomposition balance 
(Fig. 2C). 
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Fig. 1. Chain propagation/backbiting equilibrium. (A) Cylic oligosiloxanes of 
different sizes (D3.7). (B) Reversible processes: Propagation through ROP of cyclic 
oligosiloxanes and backbiting of silanolate chain end. (C) Typical kinetic curves 
for the kinetically and thermodynamically controlled ROPs of cyclic oligosiloxanes 
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(D3); dashed line indicates Dy (12). (D) Strongly favored propagation process due 
to the hampered backbiting by the alcohol coordination on the silanolate end. 

(E) Kinetic curves for the ROPs of cyclic oligosiloxanes (D3 and D4) improved by the 
effect of alcohols and of self-degradable phosphonium counter cation. 


As mentioned above, the decomposition of 
phosphonium alkoxides 1-xBnOH starts with 
the nucleophilic attack of alkoxide on the cen- 
tral P atom of phosphonium cation (Fig. 2B). 
Therefore, the stability should be strongly re- 
lated to the steric hindrance around the P atom. 
Indeed, an enhanced persistence of BnOH- 
stabilized phosphonium alkoxides 1a,b,c- 
4BnOH with bulkier amino groups on the 
P atom (la < Ib < 1c) was clearly shown by 
gradual slowdown of decomposition (Fig. 2C). 
A strong acceleration of decomposition was 
observed for monocoordinated phosphonium 
alkoxides 1a,b-1BnOH (fg0°c) < 5 min), 
which is critical for using them as smart self- 

quenching systems automatically triggered by 
the loss of coordinating alcohols. Alcohol-free 
phosphonium alkoxides 1la,b-OROH decom- 
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posed at almost the same rate (4y2go°@ < 5 min) 
but with a poor selectivity due to the excessive 
reactivity of nonstabilized alkoxide ion. In this 
case, in addition to the phosphine oxide 5, the 
formation of phosphines was also observed. 
The decomposition of phosphonium alkoxides 
through f elimination triggered by deproto- 
nation at the B position of the substituent by the 
alkoxide to give a phosphine and an alcohol is 
known (23, 24). High stability was observed for 
the most-hindered variant, 1c-4BnOH, with 
piperidino substituents, and it was not possible 
to estimate its half-life because of the exceed- 
ingly slow decomposition rate (4% of decom- 
position at 80°C in 24 hours). Phosphonium 
alkoxide Ie remained persistent, with a reduced 
number of coordinating alcohols (Ic-1BnOH, 
28% of decomposition in 24 hours), even in the 
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absence of stabilizing alcohols (lc-OBnOH, 
60% of decomposition in 24 hours), and thus 
it was not suitable for our purpose. 

The ROP of D, at 80°C in the presence of the 
BnOH-stabilized phosphonium benzyloxide Ib- 
4BnOH (0.5 mol % versus D,) as the initiator led 
to complete conversion in 18 hours to produce a 
clean silicone polymer (99.6%) with only a small 
amount of D, (0.4%) (Fig. 3A), as determined by 
1H- and ?°Si-nuclear magnetic resonance (NMR) 
analysis of the resulting mixture. Despite the un- 
usually high conversion, the polymerization was 
thermodynamically controlled, as suggested by 
the large polydispersity index of the resulting poly- 
mers (J, = 1.54) (25, 26). The NMR analysis 
demonstrated that the polymer chains are 
end capped with the neutral Si-OBn group, 
which would be expected to form through the 
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Fig. 2. Synthesis and stability assessment of 
initiators. (A) Synthesis of alcohol-stabilized 
(i-propyl)(tris-amino)phosphonium alkoxides 1-xROH. 
(B) Proposed mechanism for the decomposition of 
phosphonium alkoxides 1-xROH (x = 0, 1). (C) Effects 
of phosphonium cation (size) and of coordinating 
alcohols on the stability of phosphonium benzyloxides 
1-xBnOH. Half-life time of 1-xBnOH (x = 0, 1, or 4) 

in CgDg (M = 0.07 mol/liter) at 80°C. Half-life of 
1-xROH was determined by “!P-NMR. 


decomposition of ionic chain ends (wv = 1; Fig. 2B). 
After 18 hours, the system was totally de- 
activated, as evidenced by unchanged poly- 
mer size even after heating at 80°C for a 
prolonged time (36 hours), as well as by the 
absence of any further polymerizations upon 
adding another equivalent of D, to the mixture. 

As expected, the stability of phosphonium 
alkoxides substantially influenced the ROP of 
D, (Fig. 3A). Indeed, the least-hindered cation, 
1a-4BnOH, was not persistent enough to com- 
plete the ROP (10% silicone and 90% unreacted 
D,). The ROP with the excessively stable initia- 
tor 1c-4BnOH (persistent even in the absence 
of stabilizing alcohols) led to a conventional 
polymer/cyclic oligomer mixture (polymer/D,.7 = 
83/17). These results clearly confirm the impor- 
tance of an automatic quench system triggered, 
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A Synthesis of ROH-free and ROH-stabilized phosphonium alkoxides 
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before backbiting begins, by the loss of alcohols 
to benefit from the effect of alcohol coordination 
to prevent the cyclic oligomer formation. 

The size of the resulting polymer can be 
roughly regulated by the amount of alcohol 
added (~1/5 of added ROH can effectively 
give rise to polymer chains) (Fig. 3A) (27). 
Therefore, the loss of coordinating alcohols 
should result mainly from dissociation rather 
than from chain transfer reactions. According to 
density functional theory calculations on the mod- 
el reaction [Me;Si-(OSiMe,),-O (BnOH),1a*— 
Me,Si-(OSiMe,),-O (BnOH),.;1a* + BnOH, where 
x = 1 to 4], the Gibbs free energy for the dis- 
sociation of benzyl alcohol (BnOH) from the sila- 
nolate chain end is small for x = 4 (1.6 kcal/mol) 
and x = 3 (5.0 kcal/mol), whereas it is larger for 
x = 2 (118 kcal/mol) and & = 1 (15.2 kcal/mol). 
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More precise polymer size control can be achieved 
using a disiloxane end-capping reagent, 1,3- 
divinyl(tetramethyl)siloxane (M.™), without 
losing the full conversion character of ROP. 
Indeed, varying the amounts of M.™ (2.5 to 
10 equiv.) relative to Ib-4BnOH, the end-capped 
silicone polymers with a Mn close to the expected 
value (Mnyne.) have been obtained with an 
excellent yield (>99%) (Fig. 3B). 
Polydimethylsiloxane (PDMS)-based initia- 
tors (28, 29) are common tools for increasing 
initiator solubility and minimizing impurities 
in silicone synthesis. Therefore, we consid- 
ered preparing a PDMS-based initiator using 
short o,m-dihydroxy-polydimethylsiloxanes 
[PDMSOH = HO(Me,SiO);H] instead of al- 
cohols. The addition of PDMSOH (5 equiv.) to 
phosphonium ylide 2b affords the PDMS-based 
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A_ ROP of D, using different initiators (1a,b,c-xBnOH) 


1-xROH (0.5 mol %) | | | | 
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80°C, 18h | the 


1a,b,c-xBnOH (Effect of phosphonium cation) 
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Fig. 3. Fully converted anionic ROP of Dg. (A) Study of the effects of phosphonium cations on the ROP of 
D, with different initiators 1-xROH. (B) Study of polymer size control by the end-capping agent Mp” in the 
ROP of D, with the phosphonium benzyloxide initiator b-4BnOH. (C) Synthesis of the PDMS-based initiator 
1b-4PDMSOH and its tests for the full ROPs of D4 in the presence of Mv. 


initiator Ib-4PDMSOH, which was stable at 
room temperature and decomposed at 80°C in 
2 hours (é/2 = 15 min) (Fig. 3C). The ROP of D4 
with this initiator (1.0 mol %) in the presence of 
M™ (2.5 or 5 equiv. versus Ib-4PDMSOH) af: 
forded silicones with yields >99% and with the 
expected polymer sizes (Fig. 3C). After the reac- 
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tion, no traces of short oligomer HO(Me,SiO)g,H 
could be detected by NMR and gel-permeation 
chromatography analysis, probably because of 
its consumption by polycondensation processes. 
These results demonstrate that the silanol func- 
tion (Si-OH) in PDMS can also act as a stabil- 
izer of anionic chain ends to prevent the cyclic 
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oligomer formation. We conclude that the full 
ROP of D, is actually achievable. 
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The cost of being a student 


n my first semester as a teaching assistant, a student came to me with a sensitive request. The 
professor required students to use electronic clickers to answer multiple-choice questions during 
the lecture; participation would count toward their grade. But this student confided that she could 
not afford the $50 for a clicker and $30 to register it. She offered to write her answers on paper and 
give them to me after every lecture, or just accept a 0% for attendance, knowing that she would 
need to excel in other aspects of the class to make up for it. She was proud of her work ethic and 
sure she belonged in academia. She reminded me of myself—and of our responsibility as educators to 


support poor students without singling them out. 


For my bachelor’s, I had chosen to 
attend the school where I received 
the most financial aid, which was 
a private university. I felt privi- 
leged to study there. But I was 
surrounded by people who could 
easily afford it, and I struggled with 
feelings of isolation and insecurity. 
I felt I needed to hide my lower 
socioeconomic status to fit in. I lied 
about the many hours I worked at 
part-time jobs, why I couldn’t go 
to the movies, and why I couldn’t 
ask my family for money. 

Most professors didn’t help 
much, often requiring expensive 
textbooks and class materials with 
seemingly little thought to the bur- 
den this placed on some students. 
Sometimes I could find the books 
at the library or buy less expen- 
sive used versions. And sometimes 
I sought accommodations from 
professors, even though it felt uncomfortable to share that I 
needed them. But it didn’t always go well. When I asked my 
organic chemistry professor, for example, whether I could buy 
an older version of the textbook, he tersely said no; only the 
new textbook had the access code for the homework software. 
As he talked, I did the mental math of how many extra hours I 
would need to work to pay for the $350 textbook. 

There were rare exceptions, such as my calculus profes- 
sor. On the same day as my conversation with the organic 
chemistry professor, he gave me a much-appreciated boost 
when he loaned the entire class his own textbooks for the 
semester. Other students may have turned up their noses at 
the books’ tattered condition, but I was incredibly grateful— 
both to be saved the expense, and to not be set apart from 
the rest of the students. His gesture, and similar efforts by a 
handful of faculty and staff, helped me make it through my 
undergraduate and master’s studies. 


“| realized sharing my earlier 
struggles could help low-income 
students feel less alone.” 


When I learned Ph.D. students 
are typically paid a stipend, I was 
delighted. And once I started grad- 
uate work, my financial struggles 
dissipated. Many students found 
the stipend meager, but I was an 
expert at making ends meet, and I 
felt my classmates and I were now 
in the same boat. I finally felt I be- 
longed in higher education. 

When that student told me she 
couldn’t afford a clicker, though, I 
realized sharing my earlier strug- 
gles could help low-income stu- 
dents feel less alone. I told her I 
had once been in her shoes. And, of 
course, I let her submit her answers 
on paper and manually entered her 
attendance into the gradebook. 

In the years since then, I’ve 
continued to talk about my expe- 
riences when they seem relevant. 
I still feel vulnerable opening up 
in classes where the majority of students comfortably af- 
ford college, but it has been incredibly rewarding. Many 
students have now shared their financial struggles with me. 
This past semester, for example, one of my most diligent 
students confided that he worked long hours in the univer- 
sity’s cafeteria and needed to do the bulk of his coursework 
late into the night. So, when he occasionally fell asleep in 
class, I knew it reflected anything but lack of interest, com- 
mitment, or ability. 

All low-income students should have mentors who can help 
them cope with the many expenses associated with college 
and make them feel welcome. And I hope that more academ- 
ics can be like my calculus professor, who with his simple ges- 
ture made such a big difference for me. 


Faye Harwell recently received her Ph.D. from Boston University. Send 
your career story to SciCareerEditor@aaas.org. 
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Flexible Positive Pressure Sensors 

Tecan has announced the launch of the Resolvex 
A200 Omics and Resolvex A200 24, the next 
generation of Resolvex A200 positive pressure 
processors. The Resolvex A200 is a standalone 
system for automated positive pressure sample 
preparation which is designed to address unmet 
workflow needs by improving the flexibility and 
efficiency of the sample preparation process. The 
Resolvex A200 Omics incorporates a HEPA filter 
and fully enclosed environment to reduce sample 
contamination in highly sensitive mass spectrometry work. The 
Resolvex A200 24 is compatible with larger volume consumables 
including 3 and 6 ml columns and 24-well plates. The system is 
also available with accessories and adapters that allow users to 
run more workflows, such as giga-sized nucleic acid purification 
methods. Dr. Klaus Lun, executive vice president and head of the 
life sciences division at Tecan, commented: “We are excited to 

be launching the Resolvex A200 Omics and 24, additions to the 
Resolvex A200 family that give scientists the ability to perform 
various workflows—from proteomics and bioanalytical processes 
to NAP and food safety workflows—with even better accuracy and 
throughput.” 

Tecan 

For info: +44 (0) 1480 405333 
www.tecan.com/resolvex-a200-next-generation-positive-pressure- 
processing 


Artificial Liver Model With Predictive Drug Clearance Simulation 
British biotech firm CN Bio has released the PhysioMimix™ Single- 
organ Higher Throughput System, adding to its product line of 
microphysiological systems that simulate the function of human 
organs for medical research purposes. Designed to overcome 

the barriers to widespread adoption of predictive human liver 
models, the PhysioMimix HT has a significantly reduced cost-per- 
chip and greater run capacity that makes it effective earlier in the 
process of drug development and approval. Along with it, CN Bio 
is also releasing the Liver-48 multi-chip consumable plate, which 
miniaturizes the company’s human liver model for applications 
that include predicting drug-induced liver injury or determining 
drug metabolism/hepatic clearance, and the modelling of various 


prevalent liver diseases such as non-alcoholic steatohepatitis (NASH). 


CN Bio 
For info: +44(0)7891477378 
https://cn-bio.com/physiomimix-ooc/ 


Configurable Robotic Platform For Advanced Therapy 
Manufacturing 

Despite the therapeutic potential offered by cell and gene therapies 
and the rapid growth of the sector, these treatments remain 

largely unavailable to patients worldwide. Because large-scale 
manufacturing capability lags far behind therapeutic innovation, 

it can take years before newly discovered advanced therapies 
become accessible to those who need them. Cellular Origins has 
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developed the Constellation platform to bridge that gap, providing 
therapy developers access to sterile manufacturing at scale. 
Combining highly flexible, autonomous mobile robots, proprietary 
tube welding technology, and automated tube management, 
Constellation is capable of precise and sterile fluid transfer. A 
configurable console system can be built around the individual 
needs of each manufacturing facility and the therapy they produce. 
This combination provides end-to-end digital interconnection, 
including quality control systems and fully remote digital control and 
analytics. 

Cellular Origins 

For info: +44 (0)7771 730919 

https://cellularorigins.com/ 


Modular flow chemistry system 

The Uniqsis FlowLab Plus™ is a flow chemistry system with a 

wide choice of modules to match different applications. Designed 
around the Binary Pump™ dual channel reagent delivery system, 
the FlowLab Plus™ can be configured to run both manual and 
automated flow chemistry reactions. The system may be configured 
with any combination of up to four individual reactor modules. With 
the combined HotCoil™ and HotChip™ heated reactor modules 
alongside the Polar Bear Plus Flow™ or Polar Bear Plus GSM™ 
cryogenic modules, an operating range from -40 °C to 300 °C is 
possible. The addition of FlowControl II software allows you to 
integrate your FlowLab Plus™ with a fraction collector, enabling you 
to automate reactions from either reagent bottles or automated 
sample loops for reaction optimization studies. The computer, 
pumps, and heating/cooling modules in a FlowLab Plus™ are 
connected over a LAN using an Ethernet hub, enabling the system to 
be controlled remotely by Wi-Fi, and allowing the control computer 
to be operated outside a fume hood. 

Uniqsis Ltd 

For info: +44-1223 942004 

www.uniqsis.com 


INTEGRA Biosciences streamlines serial dilutions 

Serial dilutions are widely used across many life science disciplines 
and performing them correctly underpins the success of later 
studies. INTEGRA Biosciences’ electronic pipettes, automation 
platforms, and peristaltic pumps are designed to ensure 
consistent liquid handling and minimize human error, therefore 
increasing throughput and reproducibility of serial dilutions. These 
instruments are adaptable for applications that simplify a range 
of liquid handling workflows and enhance lab productivity. These 
liquid handling solutions can be used to standardize and streamline 
pipetting protocols in a variety of applications, including standard 
curve preparation to quantify nucleic acid concentration and 
validate qPCR assays, minimum inhibitory concentration testing in 
healthcare and drug development, and bacterial enumeration for 
epidemiological studies 

INTEGRA 

For info: +44 (0)1480 405333 

https://www.integra-biosciences.com 
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